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l crcTTAcrcr TCAGCCTGAT gtcaaaagca aaagttcaga agttcctcat 

51 CAATAAGGAG TCCTTGTGAG CAGGTGAAGC TCATCTAACT AGGCATTTCT 
101 ATGATGTGGC TGCTTTTAAC AACAACTTGT TTGATCTGTG GAACTTTAAA 
151 TGCTGGTGGA TTCCTTGATT TGGAAAATGA AGTGAATCCT GAGGTGTGGA 
201 TGAATACTAG TGAAATCATC ATCTACAATG GCTACCCCAG TGAAGAGTAT 
251 GAAGTCACCA CTGAAGATGG GTATATACTC CTTGTCAACA GAATTCCTTA 
301 TGGGCGAACA CATGCTAGGA GCACAGGTCC CCGGCCAGTT GTGTATATGC 
351 AGCATGCCCT GTTTGCAGAC AATGCCTACT GGCTTGAGAA TTATGCCAAT 
401 GGAAGCCTTG GATTCCTTCT AGCAGATGCA GGTTATGATG TATGGATGGG 
451 AAACAGTCGG GGAAACACTT GGTCAAGAAG ACACAAAACA CTCTCAGAGA 
501 CAGATGAGAA ATTCTGGGCC TTTAGTTTTG ATGAAATGGC CAAATATGAT 
551 CTCCCAGGAG TAATAGACTT CATTGTAAAT AAAACTGGTC AGGAGAAATT 
601 GTATTTCATT GGACATTCAC TTGGCACTAC AATAGGGTTT GTAGCLI I 1 1 
651 CCACCATGCC TGAACTGGCA CAAAGAATCA AAATGAATTT TGCCTTGGGT 
701 CCTACGATCT CATTCAAATA TCCCACGGGC ATTTTTACCA GGTTTTTTCT 
jf 751 ACTTCCAAAT TCCATAATCA AGGCTGTTTT TGGTACCAAA GGTTTCTTTT 
y 801 TAGAAGATAA GAAAACGAAG ATAGCTTCTA CCAAAATCTG CAACAATAAG 
q 851 ATACTCTGGT TGATATGTAG CGAATTTATG TCCTTATGGG CTGGATCCAA 
y 901 CAAGAAAAAT ATGAATCAGA GTCGAATGGA TGTGTATATG TCACATGCTC 
y 951 CCACTGGTTC ATCAGTACAG AACATTCTGC ATATAAAACA GCTTTACCAC 
O 1001 TCTGATGAAT TCAGAGCTTA TGACTGGGGA AATGACGCTG ATAATATGAA 
HI 1051 ACATTACAAT CAGAGTCATC CCCCTATATA TGACCTGACT GCCATGAAAG 
* 1101 TGCCTACTGC TATTTGGGCT GGTGGACATG ATGTCCTCGG AACACCCCAG 
jf 1151 GATGTGGCCA GGATACTCCC TCAAATCAAG AGTCTTTCAT TAGTGCTAAG 
5 1201 CCTATTGCCA GAATGGGAAC CCACCTTTGA TTTTGTCTGG GGCCTTGATG 
y 1251 CCCCTCAACG GATGTTCAGT GGAAATCATA ACCTTTAATG AAGGCATATT 
£ 1301 TCCTAAATGC CAATGCATTT TACCTTTTTC AATTTAAAGG TTGGTTTCCA 
1351 AAGCCCTTAC 
(SEQ ID NO: 1) 

FEATURES : 

5'UTR: 1-100 
Start Codon: 101 
Stop Codon: 1286 
3'UTR: 1289 

Homologous proteins: 
Top 10 BLAST Hits: 

CRA 1 18000004922653 /al ti d=gi 1 7434997 /def=pi r 1 1 G01416 1 ysosomal ... 431 e-120 

CRA 1 18000004903706 /al tid=gi 1 542751 /def=pi r 1 1 S41408 lysosomal ... 430 e-119 

CRA 1 18000004924799 /altid=gi 14557721 /def=ref |NP_000226. 1| lipa... 428 e-119 

CRA 1 98000043616611 /al ti d=gi 1 12844223 /def=db j | BAB26283 . 1 1 (AK0 ... 415 e-115 

CRA 1 98000043617058 /al ti d=gi 1 12845127 /def=dbj | BAB26629 . 1 1 (AK0 ... 415 e-115 

CRA 1 98000043616593 /al ti d=gi 1 12844194 /def=db j | BAB26272 . 1 1 (AK0 ... 414 e-115 



FIG.1A 



Q 

o 

o 
y 
u 

Q 

m 

s 

u 
m 
o 

o 



4 



Docket No.: CL001186DIV 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS, 



CRA 1 98000043617174 /al ti d=gi 1 12845372 /def=dbj | BAB26725 . 1 1 (AKO ... 414 e-115 

CRA 1 98000043617140 /al ti d=gi 1 12845298 /def =dbj | BAB26697 . 1 1 (AKO ... 414 e-115 

CRA 1 98000043617224 /al ti d=gi 1 12845477 /def=dbj | BAB26766 . 1 1 (AKO ... 414 e-114 

CRA 1 98000043616955 /al ti d=gi 1 12844939 /def=dbj | BAB26556 . 1 1 (AKO . . . 414 e-114 

EST : 

gi 1 8003062 /dataset=dbest /taxon=960... 62 4e-07 

gi 1 8000757 /dataset=dbest /taxon=960... 54 9e-05 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gi 1 8003062 Stomach normal 
gi 1 8000757 Stomoach normal 

Tissue expression: 
Human leukocyte 
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1 MWLLLTTTC LICGTLNAGG FLDLENEVNP EVWINTSEII IYNGYPSEEY 
51 EVTTEDGYIL LVNRIPYGRT HARSTGPRPV VYMQHALFAD NAYWLENYAN 
101 GSLGFLLADA GYDVWMGNSR GNTWSRRHKT LSETDEKFWA FSFDEMAKYD 
151 LPGVIDFEVN KTGQEKLYFI GHSLGTTIGF VAFSTMPELA QRIKMNFALG 
201 PTTSFKYPTG IFTRFFLLPN SIIKAVFGTK GFFLEDKKTK IASTKICNNK 
251 ILWLICSEFM SLWAGSNKKN MNQSRNDVYM SHAPTGSSVH NILHIKQLYH 
301 SDEFRAYDWG NDADNMKHYN QSHPPIYDLT AMKVPTAIWA GGHDVLGTPQ 
351 DVARILPQIK SLSLVLSLLP EWEPTFDFVW GLDAPQRMFS GNHNL 
(SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_G LYCOS YLATION 

N-glycosylation site 

Number of matches: 5 

1 35-38 NTSE 

2 100-103 NGSL 

3 160-163 NKTG 

4 272-275 NQSR 

5 320-323 NQSH 



[2] POOC00005 PS00005 PKC_PH0SPH0_5ITE 
Protein kinase C phosphorylation site 

Number of matches: 4 

1 125-127 SRR 

2 204-206 SFK 

3 243-245 STK 

4 266-268 SNK 



[3] PDOC00006 PS00006 CK2_PH0SPH0J5ITE 
Casein kinase II phosphorylation site 

Number of matches: 8 

1 53-56 TTED 

2 130-133 TLSE 

3 132-135 SETD 

4 142-145 SFDE 

5 162-165 TGQE 

6 185-188 TMPE 

7 274-277 SRMD 

8 348-351 TPQD 
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[4] PDOC00007 PS00007 TYR_PHOSPHOjS]TE 
Tyrosine kinase phosphorylation site 



161-168 KTGQEKLY 



[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 

1 14-19 GTLNAG 

2 117-122 GNSRGN 

3 121-126 GNTWSR 

4 175-180 GTTTGF 



[6] PDOC00110 PS00120 LIPASE_JSER 
Lipases, serine active site 

5 167-176 LYFIGHSLGT 



Membrane spanning structure and domains: 
y Helix Begin End Score Certainity 
□ 1 3 23 1.398 Certain 

m 2 167 187 1.637 Certain 

3 248 268 0.715 Putative 

y BLAST Alignment to Top Hit: 

£ >CRA 1 18000004903706 /al ti d=gi 1 542751 /def=pi r | | S41408 1 ysosomal aci d 
2 lipase (EC 3.1.1.-) / sterol esterase (EC 3.1.1.13) 

precursor - human /org=human /taxon=9606 /dataset=nraa 
/length=399 
Length = 399 

Score = 430 bits (1094), Expect = e-119 

Identities = 211/394 (53%), Positives = 274/394 (68%), Gaps = 2/394 (0%) 
Query: 2 mwllltttclicgtlnaggfldlenew^ 61 

M L CL+ TL++ G V+PE MN SEII Y G+PSEEY V TEDGYIL 

Sbjct: 3 MRFLGLWCLVLlvTWSEGSGGKLTAVDPEThD^ 62 

Query: 62 vnripygrtharstgprpvvymqhalfaw 121 

+NRIP4GR + GP+PW++QH L AD++ Wf N AN S LGF+LADAG+DVWGNSRG 
Sbjct: 63 WRIPHGRKNHSDKGPKPWFLQHGLl^SSN^ 122 
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Query: 


122 


Sbjct: 


123 


Query: 


182 


Sbjct: 


183 


Query: 


242 


Sbjct: 


243 


Query: 


302 


Sbjct: 


303 


Query: 


362 


Sbjct: 


363 



NTWSR+HKTLS + ++FWAFS+OEMAKYDLP I+FI+NKTGQE++Y++GHS GT7TGF+ 



AFS +PELA+RIKM FALGP S + T + LP+ +IK +FG K F + K 



T +C + IL +C L G Nf+N+N SR+DVY +H+P G+SV N+LH Q 



+F+A+DWG+ A N HYNQS+PP Y++ M VPTA+WfGGHD L DV +L QI + 



S +PEWE DF+WGLDAP R+++ NL 



pj Hmmer search results (Pfam): 

■ Scores for sequence family classification (score includes all domains): 

h* Model Description Score E-value N 

111 

O PF00561 alpha/beta hydrolase fold 46.7 2.5e-13 2 

3? 

£f Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



PF00561 1/2 112 195 .. 1 71 [. 38.8 6.7e-ll 
PF00561 2/2 294 352 .. 139 196 .. 8.0 0.19 
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1 TTATGGCCTA ACL I I I I IM CTTTGAGTTA TTTTCAAGAG AAAATTTGAA 
51 AAAGCAGCCT TTGAGGAGAA AGAAGCAATC CAACAAACAA AAAGATAACC 
101 ACACTGTAAT AGGAAATGTG TTTTGAATAG GACATTGGAA GAAAAATAAT 
151 AATCAI I I I I ACAGGTAGAT CCCAAAGTCA AGGATCTATG TTCAACCATG 
201 TGTGTTCCAC CATCTTCACA ATTGAATGAG TAACCATCAT TAAGCAGTTA 
251 GCTTAGGCCG TAATATGATT CTTGGACTGA GATTTCAAAA ATACCACAGG 
301 CCTTCTGAAA GGTTACCCCT TTCTAGCTCC ACTATCATCT AATTTTATTA 
351 AAAAAAAAAA AAAAGGAAAA ATTTGAGCTT CTAGAGAGTA GGGGCTACCA 
401 TTTTGTATCC CACAGGGCCA AGGAACAAGT TTTAATGTAT TCATTTAAAT 
451 TAATTTCAGT ATGAGTATTG AAATATATAA TAGAAATATT GTAACATTAT 
501 ATATTTTCTA TATACTTTTA TTATATAGAA AATATATATT ACAGAATATA 
551 TTATTAAATA TTGTAGAACA ATATATAATA CAGAAAAATA TATAATACTC 
601 AGTAATATAT TAAATACTTA TTAAAATAGC AAGCTTATAT AGGAAGAGTG 
651 ATGGAGCATT GTGAGAAAGT TTCAGCTTTA TTTCTTTGAC ATTACTTTGT 
701 TTCTGCACAA ACAAAAGAAT TACAGGAATT GTCCAGATTA TTCAAATAAC 
y, 751 TCGAAGTTGA GGAGGGAATA TAAGTCAATG ATGTAGAAAC TCTTTTAAGA 
Q 801 TTTGAGCTAG CCTACAATCT GTAAAGATCT GTGAAATTGA ACTATATTTG 
5 851 TGCTATTTCC ATATTAAGTC AAGGCAACAA ATCAATATTA ATAATAATAA 
d 901 CATAGCACTT CTAGAACTTT CTAAAGAGTC CAATAAAGTT TTGTTAGAAA 
™ 951 GGATTGTTTT TGAAGTTAAA AACCATGAGA AATTCCAGGA AAATCCACAT 
W 1001 ACCTATGCCA TCATACTATC AATCAGGGCA AAACATGCTT GAGTCTTTCA 
Jf{ 1051 TCAAGACTAA ATGATTAAGG AGTGGTACAT AACTTTTCCC TGTTCTGACT 
1101 AGCTGAACAC TTCLI I I I AC TCCACATTTG TTTAATTGGC ATGAAATTTC 
1151 CCACTCCACT AAAACAGATC TTAGGATTTG GACAACACAA AATATCATTT 
1201 GTTTTGAAAG GATTTGAGGA TAAATCCAAA CTAATAGAAC TGAAACTTCT 
O 1251 ATATTATGCT GGGTAGCAAC TTAGI I I ICC CTACCCTTCT TCATGCTGGG 
CP 1301 AGATGAAAGA GATTCAGTTA CGGCTTAAGC TCCACAGGCA TACAAAGTGA 
O 1351 AGCAGAAAAC TGAGGCACGT GTGCCTCCAT TATCTGGTAT CTCATGTGGG 
H" 1401 GCTTAGAGGT AAATTGTCGT TATTTGGCCT CCATTTCTGC CTTTAACCAC 
1451 TGGTGTAAAC AAAGGTTACT GTGCCAAAGT TGACAGCAAC CCAAATCCCT 
1501 TTGGCATGTG AATTAGTTTC CTCTGCCATA CTGCTAGTTC CAAATTCCTT 
1551 CTGGTTTCAG GATTTAGGAG TCAGGGTTGC CTCATCTTCT CAAATGAGTT 
1601 ACAGTCACGC ACATCCCTAC ACACTGCATG GTTGGCACTA GTTCCTTGAT 
1651 ATATGTTACT CCGTTTGATC CTCATGAAGG ATCAAATGGG GAAGGGAGAT 
1701 ACTATTGTCT CTGATTGTCC ATTAAGATCT TGAGTATGTT CTACTTCCCT 
1751 GTTTGACACA CTGGTTTGAA AATGTTGCTA AGTCTTCCCA ACAATGACAG 
1801 ATACTCAGTG GAAACATGAA GGATTCCGTC AAACTGGTTA TTTTGCATCA 
1851 TGTAGACCAC TATTTCCCAA CCTGCAAGTG CATCATGGCC TTTGGTGTGT 
1901 CAGGGACACG CCTTGGGTGT GTGTCTCAGT CTAAAGCTTC CTCCTTTTCA 
1951 CAAGCTTCCT GTTTCTCATC TCTCTAGCTT CTAACTGTCA CTGTAATCAT 
2001 CTCTTACTCT TCAGCCTGAT GTCAAAAGCA AAAGTTCAGA AGTTCCTCAT 
2051 CAATAAGGAG TCCTTGTGAG CAGGTGAAGC TCATCTAACT AGGTAAGATG 
2101 AAGATCTATC ATAACCAGGA GGCAGGTTGG AAGGTGCCAG TTGCACTGGC 
2151 AGTCAGGTGC AAGAGCTCTG CAGTGAGGCT GCCTGAGTGT CCATCCTAGA 
2201 TCTCTCACCT CTTGGCTCTG TGACCTTGAG CAGGTCTTAA ATCTCTCTAA 
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2251 GCCTTTGTTT TTTTAATTGA TAAAATGAGG ATAATAATAG TACCAAAATT 
2301 AGGGAGATTT TCAGAGCTTA AATAACATAC GTGAACTATT TAGAGTAATG 
2351 CCTGCCATAA GGGGACTCAG TAGCTTATTA TTAGTTTCAT ACAATTTGAA 
2401 AAGTTTCATA ATATTTGCAG ATATAAGATG ATCTTCAACC AGATAGCTAA 
2451 TGTATGCAAA GCTATTTAGC TTCAGAAGTA AACTCTGCAT TTCTAGAAGT 
2501 TAAATATTAC I I IGI IATAG TGAATTATCT GTAATATTTA TCTCTTGCTC 
2551 ACTTTTATAA GAAAAATAGT GAAAGCATTT ATTAAGAACT TACACTGCAC 
2601 TAAATGTTAT ATATGACTTA ATCCTCACTA TAACCCTATG AGATAGGTTA 
2651 CATTATTGTC CTAATTTTAC TAACAAGGAA ACCAAGAGAC AAAGCTACTA 
2701 AAACACTTGC CTGAGGTTAG ACATCTTCTT CTGTGGTGAG GCTGGATTTC 
2751 AAATTTAGAC CATTTGACTG TAGCACTTAT ATGATGAGCA TGCTGTTTAG 
2801 TGTTATAGTG TTGGTCTACC TTTGAATAGA CATALI I I IA AACCATGGCA 
2851 AGGAAGTGAG ACTGCACATT GAAATATGTA AAATTTGCCT TTGGGTGCCA 
2901 CGTGAGAAAT AGTCACATCA CTAGAAACTA ATCATAAGCT TTTGTGTTTG 
2951 GTTAAAGTTT TATTGATCCA TTTTTCTTGT TTACTTTGTG GGATACTGGG 
3001 CTTAACTAGG GGATACCTCC AG I I I I ACT TGGCCATGGT ATGAAAACCT 
M 1 3051 GTCCTCTGAA TCTTTAGATA TTTTGGCAAA TTGTAGGCAA ACAAAGACTT 
O 3101 AAAGCAATTC AACCTTGATT AAAATAAGAC CAAAAATGCC TCGATACTTG 
2 3151 ATTAAATTTA TTTCATTTTA GGAACTGGAT TATAATCAAG ACAACTTCTA 
r 3201 CATGAAAAAA TAGATTAATA GTGCTCCAAG TTAGTTCACT GTATTTATTC 
3251 CTTTTTATAC ATTATCTGCC TTCGGTGTTA TTCAAGTTTT CATTAATCAT 
3301 TAATAATTTC ACTAATCATT TTATTTCATT AATCAACATT GATAGTTAAA 
3351 ATTAATCTGT GAATATTAAA TGTTTTATGC CAGGCATTTC TATGATGTGG 
3401 CTGCTTTTAA CAACAACTTG TTTGATCTGT GGAACTTTAA ATGCTGGTGG 
3451 ATTCCTTGAT TTGGAAAATG AAGTGAATCC TGAGGTGTGG ATGAATACTG 
3501 TAAGTCATGG AAAACTGTGA AGAACATCAA ATAAAGCAGG ACTAATGGAG 
3551 TATGAGGTTA CGAAAGGTCC TGTTGTAACA GAAAATCTCT GATAAAACAG 
3601 ATAAAATGTA GATGGI I I I I AACCTCTGCA AGAGTCAAGC TAGTTAGATC 
3651 TTTGTCTGAA AAACAAATAC TGTCCGGTAA TGAAAACCAA ATTGTGCTAT 
3701 TGTGCTATCT ATCTATCTAT CTATCTATCT ATCTATCTAT CTATCTATCT 
3751 ATCTATCTAT TTATCTATCT ATCTATAGAT AGAACCTCCT CI I I I GAATT 
3801 TATGTTTTAA GAATATCAAG CTATTTGTTG ATATACATGA TTGCCTTCTA 
3851 TTGATCTATA GTTCTATTAC TTTTAAAGCA AGAGGGGTCT CAAAAGACAA 
3901 TTGACTTGAT AATATAGCTT TGTCAGAAAG AATGGGTCAA TGCTAAATTT 
3951 TCCCCCAACC CCCCAAAATA TTAGCCAATA GTAGATATTT TTTAAAATTC 
4001 TACTTATTTT GTATTAAGAC TTTATTTATT AATTTTACAG TTACCTGGTG 
4051 CTACAAATTT CAGATAATTC ACCCTAATAA GCACACAACA GATGGTTTGT 
4101 TTTGATTCCT TTTTATATCC TTTGGAGAAG TTCCACTAAC GACTGTATTT 
4151 TTACTGGGCA GAGTGAAATC ATCATCTACA ATGGCTACCC CAGTGAAGAG 
4201 TATGAAGTCA CCACTGAAGA TGGGTATATA CTCCTTGTCA ACAGAATTCC 
4251 TTATGGGCGA ACACATGCTA GGAGCACAGG TACAAGATAT GTCTCTCCTG 
4301 AAAAGGGGAC TGCATTGACC TCCTGCTTCT CAGGAGGAAT TTAATGCTAG 
4351 ATATGCATCA ACAGAGTTTA TCAAAATTGG TTTGAATTAT TGGATTAGTC 
4401 TTTAAATAGT TATCAGGGAG GCTCACTCTT TGCCTGATAA TTCTCTGAAG 
4451 ACAGACAGGA ACCTAAAAAT ACAAACAGCA AGACTGATCT TGCTAACTGC 
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4501 AACCAGAGGT ACTTGTTAGG GTGTAAACAG AAAGGCAGAG CCTGCATTTT 
4551 GTCACCTCAT TACTGATTTA TCATGTGGAA AATTGCTTTG TCCCAGGAAA 
4601 ATGGATCCTC TCATTGTCAG AAGGAGATTT TCTAGGTTGT ATGAAATTGA 
4651 CTCTGGGGCA CCCAAGAAGA ACCTCTCCTG CTCCCACTAA AATTAAGGGG 
4701 CCTCCCTCTG CAGGATAAAA AACAATCTAG TTAAATGACA ACGCATTTCT 
4751 GAAAAGTTTT CCAGGACTGA AAACCTTAAC ATCCACATAC ACTTTGATCT 
4801 AAGGGACAGA CGGTTCATAG AATGAAAGAG TATGGTGTCA ATAAGGCTTG 
4851 AATTCTAGAA TGAGGAGCCA GCCATGCCAT AGCAGGGGAA TGATACTCCT 
4901 TAAAAGGGAA AATTTAACTA CAAATCCTCT GAAGTAGAAA TGATAAGAAT 
4951 AACCAAAATA TCTGCAATGG TTCAATAGCA AATAATTTAT TGGCAGCTGC 
5001 TTACCGTGTT CATTTTGCAT CI I 1 1 I ICCC ACCACACATA TTAAGGAGCA 
5051 GCTGAAGTCA TGTTTGACAT TCTCTCCCTC TTTTATCTCC AGTTTCAGAA 
5101 TGAAAAATGA GAGTGAGATA TGAGTAGTTT TACTAGTTAA AATATGAAAC 
5151 ACCCAGTTAA ATTTGAAGGT CAGATAAACA ACAAATAATT TTGTATAAGT 
5201 CTCATTTTAA GATAATACTA AAAAGTCATT ATTTATTCAC TATTATCACT 
5251 ATTTATAAAA TTTTGTAGAG CATCCTGGAT LI 1 1 1 I GOT AC I 1 1 IGI I I 
5301 TTAI 1 1 1 I IG CTAAATCTGG CAATCCCAGG CACATGTGTG AAGGAGCTGT 
5351 GAAATATAAA AGGAGAAAAC TTTTATGGGA AAGATTTGGC TTAAGGAGAG 
5401 ATAATTTTGG AAAGATTTAG AATTAAAGAT CATTCATTAG ATGTAATGTT 
5451 CTAAATACTT TATATCAGTT AAACTTCTCA TCAACAATAT GAGATGGGTA 
5501 CCACTAATAG TCACCATTTC ACAAATGATG AAATTAAGGC ACAACCGGTT 
5551 ATGTTAAGAG GCCTAAAGTC CACAAATAGC AAGCTGACAG ACCAGAATTT 
5601 AAGCCCAGGC ATGCTGGCTC CAGAGCCTGT GCTCTTAGTC ATTAAATTAT 
5651 AGTGCCTTAC TTGACCTTCC ACCCTGGTTA CTTTGGATCT CCCTGAATGC 
5701 TCTCTCTCCC TCAGAAATAC TGGAAGTTGG CAGAGGGACA CTGAGCTGAG 
5751 CATATTATTG TAG 1 1 1 1 IAA ATGCTCTCCA CTGGACAGAA GATGGGGGAT 
5801 TTGAATAGAA ATTTGGTGAG GAACTAATCA GTGTCCATTT ACACTCACCT 
5851 CCTCTTCCTC CCTGGAAGAG CTATAGGACT TGAGTAAGCA TGATAAATTT 
5901 CGTGTCTTTG TAAACCACAC CCAGGAAATT TGTATATACA AATACATAGA 
5951 GCACAGTAGT TATCAGGACA GACTTTGACA TAAAAAGAAC TGGGTTTGAG 
6001 TCCCTGCTCT GGCCTTCTTA TCTGGGTGGC CCTCTGGGAA AGTTACTTAA 
6051 CTACATAAAG I 1 1 IGI I ICC ATATCTACAA AATGAGGTTT CTCAAAATAG 
6101 CAGCTAGTTT ATAGAGTTGT TGCAAGAATT TAGTAAGCTA ATACATATAA 
6151 ATACGTCAAC ATAGCACCAG GTACAAAAAT ATGTGCTCAA GAAACTGAAG 
6201 TTACCTGATT ATAATGCTCT ATACTATTGA CAAGGGAAAA GTGAAAACAG 
6251 I 1 1 I IGI I I I ACCATGTGTG TATGTGTGTG TGTCTGTGAT GTTTCCGACA 
6301 TGCTCTATTT AACATAAATT ACTCTCACTC TTTCTCTCTC TCTCTTTCTC 
6351 TTTCTCCCTC TCTCATCTTA CCCTTTCCCC CACCAGGTCC CCGGCCAGTT 
6401 GTGTATATGC AGCATGCCCT GTTTGCAGAC AATGCCTACT GGCTTGAGAA 
6451 TTATGCCAAT GGAAGCCTTG GATTCCTTCT AGCAGATGCA GGTTATGATG 
6501 TATGGATGGG AAACAGTCGG GGAAACACTT GGTCAAGAAG ACACAAAACA 
6551 CTCTCAGAGA CAGATGAGAA ATTCTGGGCC TTTAGGTAAA TATTAGCTAA 
6601 GAAAACTCAA GGGGGAAATT GGAGGCAATT TTAAAAAAAT AACGTGGACG 
6651 CTATTAATGA TTATCTTTGA CGCTTGAAGT CATATAGCTC CTTGTAGTTT 
6701 CTGTTAAGAT CTCAAAGGAG GGTAACAGCA AGAAGCTCTG Al I I I ICACT 
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6751 GATTCTCCCA CAAGCAAAGT ATGGCATTTC AACAAGATCA 1 1 I I IACATC 
6801 CAATTCTGTG AATTCTATGC ATTAAAAGTA TGTCCAAAGA GACAGCTCAG 
6851 GAAATTATCA TGACCAATGT GCACATTCAT TCAGCCAATG TTTACTGAGT 
6901 GGCTACTGTA TGCGCTGTTC TAGGCCCCGA ACATTCAAAC AGGGAACAGA 
6951 CAAACTCTGA CCTCACAAAG CTTATGTTCA TTTTAGTGAT AATTTTACAA 
7001 GTCATTGCTC CTGGATTGCC AATCAACTGT GTAAAGATGA TTTGGACCAG 
7051 GACCTTATTG ATTTAGAGAA ACTGTGATTG ATTTAGAGAA ACTGAGATCG 
7101 CACATAGTAC CATTTTCAGG AAAACTCCAA TATTAGATTT TTAAAACCTT 
7151 GTTAATGGGC AATGAAGAAG AATCI I I I I I GATATCTTGT TTCTTTTAAT 
7201 GGAAGAGTTT TCTGCTGTCA CCAGAGGACA GGCTGATGCC TGCGATAGAC 
7251 1 1 1 IU I ICT TCAGGCCTAA GCTCCCTGTT GGTTTGTAAA CCTGATGCTA 
7301 GAACAGACTG TGTATTCCTA TTACATTAAT AAAACATTCA GTACCCACTG 
7351 AAAGTTTGAG AATAGTGGAG GAATAGAATA GAATGTTATA GTCTGAGTTC 
7401 TTGGGCAGGG GCAAGCATCA GGAAATATTG AATCATTAGT CTTTAGGAGG 
7451 TGTCACAACA ATTCTCCTAT TCTTGTAAGT CCCAATCTAT AGATTTCCTC 
Hi 7501 ACATGTTCTT TTAATAAACA GGCTTCTAGC TTATGGAATA CCTGATTTGA 
O 7551 CTAAATGTTA TATAGGCCCT TTTGTTCCTC CTGTCTGAAG AACAAAATAC 
D 7601 TAGTACTATG GAATATTGGT ATATATTAAA TATATATCTA TATATCCATG 
P 7651 TGGACAGGAA TACTACTACT AACAACATCT TACTGAGCAC CCACTGGCAG 
g 7701 CCAGAGTCGT TTCTTTCATA CTATTAAACC CCGTTAGCAG CCCCGTAAAC 
p 7751 CAGGTACTAC CCTGTTTATT TCCCAAATGA GAAAACATAG GCTCAGAGCA 
Jy 7801 TTTCAGTAAT TTCTCAAGAG TTGCAAAGGC CATAAATAGT AGAATCATGA 
s 7851 TTTACAAAAC CCCTGTTTCC AAAGATGGGT ATTAAATGGT CCTAACAATT 
H 7901 GTGAAGCCTC ATGTGGGAGT CAGAAGTAGA GGCACACAAG CCAGATGGGG 
fil 7951 AAAGGGAGGG CAAAGAAAAG CAAGAGAAGG GAAGGAAGAG GAGGGATCAT 
O 8001 AAGGTTGAAC TTCAAATATC ATACACAAGT TTCGAAAGTG TTCCTCTTAT 
J 8051 AAGGAAGTAA AATGTACATA TGCAGAAAAA CAAAAAGCTA CAATAGCCTA 
O 8101 CATATAATTG GATAAATAAT GAAATACACA TTGAATCTAA GTAAACAGCA 
^ 8151 TAGAATCTGG GTGTAAAAAA GAAGTGAGCA AGTGCTCTGA GTTTTAAACT 
8201 TAAACTTGCA AGTATTTATA AAAGCCCCTG TTTTATTTTG CAGTTTTGAT 
8251 GAAATGGCCA AATATGATCT CCCAGGAGTA ATAGACTTCA TTGTAAATAA 
8301 AACTGGTCAG GAGAAATTGT ATTTCATTGG ACATTCACTT GGCACTACAA 
8351 TAGGTATGTT TATGAGGGTC ACTGTTAGGT GTGTTTTTGA GGGTCAGTTT 
8401 TCTCAGAGTC TTACAGGAGT TCACCTTTAT GTTGGAATAA AACAACTGTT 
8451 ACTTATAGTG CCCTCAATTC CCTGTCCTCT GCTGGGAATA ACCCTAGTAC 
8501 TCTAAGTAGC TGTGAGCCTG CAGTGCACAG ACTATATGTA GGGCAAACCT 
8551 TTCCTGGGTC TCTGGTCACA GCAGCATATT GACTACGGTG ATGCAATTTC 
8601 CCAGGAATAA CATGTGTTCC AAATTCAAAG AAATAATTCC ACAGAGTAAG 
8651 TTTCTAGATT CCCTCTGAGC TGAAAAAGTA AAATTCAATG CCATGGAATA 
8701 TGGCTGAAAC ATAATAAATG TGCATCAATC ATCTCTTTCT CACAACCCAA 
8751 ATGGGATTTT TAAAAAATAA AAGGGAAGGG CTTATACCTA TATTTAAACA 
8801 AATTGAAAAG GCATGGTTAT ATTTGTTTGT GAGTTGGAAC ACACAAGCTT 
8851 ACTATAATAA ATCAATTGAG CTTATCTATT CAGTGTGTGA TTTAGTATTT 
8901 ATGAAATAGC AAGTAAATGT AAGCACTATG TAGAAATTTC TAAA G I I I I I 
8951 TAAGCTGACA ACTTACTTCT TAATTTACTT ACTTTACTTA ATTTACTTTA 
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9001 CAATTTACTT TCCAGGTATT TTGGAAAGAA ATCAATAATC TAGTTCCAAG 
9051 TAAAAGTTGA AAGGAACCCA CACTAATAAA AGCTTTGAAT TTGTCATTGA 
9101 ACTTCCACTA AAGTTTCCAA TTTTAAGAGA ATAAATCATG TGAAAGTGCA 
9151 ATATTTCAGT TTAGGGAAAT ATTTTCATTA TCACCACTAT CATCAGTAAC 
9201 AAACATATAT TCATTAGTAT TTTAGATTGA CAGGCACTTT CCAAGCTCAG 
9251 AACAGGCAGT TAGCATCAGT CAGCATATAC TAAAAAAGTA TCAAAGAACT 
9301 CATAGGAGAT CAAAAATGCC ACCAATAGGC AAATAATTAC AGTATCTAAC 
9351 ACTTATTGAG CATTCGTTAT GTGTAGGGTC TTGTGTTCAG GACCTTCCCC 
9401 ACAGTATCTC CCTCTGATCT TCAAAACAAC CCGAATGTTA TTATCCCCAT 
9451 CTCATAGAAG AAGAAACACA AGTTCAGAAC ACAGATTCAA ACCAGATGTA 
9501 TCTGATTTCA CCAATAGGGT GTGTAAGGAT TCCGGAGAAA TGGTGTAGAG 
9551 AAGAAGAAAT GACTTTAGTT GGI I I IGGAA AGTGGGTAGG ACTTAGATAT 
9601 GCTCTTATAC TTGATCTGCA AAAAAAAAAA AAAAAACCAT GGAGAATTTG 
9651 ATTATCTGTG CTCTGTGTTT CATTTAGGAC ATAAATATTT TTAGTGACTG 
9701 TTGTTTGCAT TTTGGACAGA GCAATTTCTG TTATGTAAGG AGCACCCACT 
9751 CTTTGTAGGA CATTTAGTAG GTCCCAGCCC ATTAAACAGG GCTCTGCAGT 
tj 9801 CAGCGTGACC CTCAAAAATC TCACCTCCAC ACATTTCCAA ACACCCTCTG 
~ 9851 GGGAAGTACT ATTCCTGATT CAGAGTCTTT TTATCAATTG TTCAGTCAAT 
g 9901 TATTTCAGTT CI I C_ I I I I IC TGGCCAAGAC AGTTTTAATG TTCCAACAAG 
Jj 9951 TGTTTCAGTA CACACATACA CACACACACA CACACACACA CACACACACA 
yj 10001 CACATGCTAG TGGAGGCCCA GGAAGGGACC TCTGGAAACC AAATTATATG 
□ 10051 GATATTCTCC CTAGCCTACC CAGTGTTGTG CTAATCTCCA TCCTCACAGA 
nj 10101 TATACAAAGG GGTGCAATGC TACTGCTGAA AGAGCAAAGC AAATGGAGAT 
10151 GCCTGGTCCT TACTGGGCCA TCGTGGATGC TAGGGAAAGC CCCTTTCTTT 
P 10201 TTGGAAACAG GGAAGAGTCT AGAGGGTTGA AAAACACCCA GTAAGACACT 
J 10251 GGGAGCAGTG AAATTTCATT CCATAGTGAG AAAGAAAACC TGTTAGAATA 
2 10301 ACTGGGTGAT GCTGCAGAAA GAAATCAATT CACCTCCTGT GACTGATTAT 
£j 10351 TTGCTTCTGG AAGCTCTGTG ATTCATTCTG GCATCTCAGA GTTAGGGATG 
10401 AAATGAGAAT GTTGCCAGCA TTTACCCCAT GCTTGGGAAG TTTACACAGC 
10451 AGTAGCTACT CCAGCAGCTT AACCATCACC TTTCCCCTGC CAACTACTCC 
10501 ATTTCCCCCA ATCAAGTCAA ACTGTCCATA AATAGAATAA AATAAAATTG 
10551 GAGACTTGAG AGCAGAGAAG ACTGAAGGCA GATTATCTTT ATAGAATAAC 
10601 TCAGAAGACT TCCAATTCAT CCCCAGTATG ATCACGATAG AAGGAAAAAA 
10651 TGACTAAGCA GAGCCCCAAT 1 1 IGI IAGAA ACATTGCGTA AGTATTTATT 
10701 TTTACAAGAT TGTCTTATCT CCTGTTCTCT CAGGGTTTGT AGCCTTTTCC 
10751 ACCATGCCTG AACTGGCACA AAGAATCAAA ATGAATTTTG CCTTGGGTCC 
10801 TACGATCTCA TTCAAATATC CCACGGGCAT TTTTACCAGG I I I I I ICTAC 
10851 TTCCAAATTC CATAATCAAG GTAGGCTCCT TTCAACAAAA TGTACCTGAG 
10901 GATCTCATTT TGGATCATAA ATCCTTATTA TTTTCAAATC TACTGTAAAG 
10951 TAAAAGTAGG AAATTTAGAT AAAATCTATA GAACTTAGAC TCTGTGGGTA 
11001 TGTGCTTGTG TATGTGTGTC CCTGCGTGTG CGCATGTCTG TGCCATAGTA 
11051 TCTGCAGGTT CTGTAATACA ATTTACTATA CAAGGTCATC AGCAGGCTGA 
11101 GTATATGTCA GAATTTCTAG CTGAACTGAG TGCTATATGA CAAGAAGGAT 
11151 UNCI IGI I TTCCCAAGTG I 1 1 I I IGI IC CATTTAGTCA GGTAGGTCAA 
11201 TGAATTCACA TTGCCCAAAT GAAAGACACT TCAAGTTACC CATAATCACT 
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11251 GATGTGTCCA ATTTTGACAT TAGAAAAACC TGATTAATAT ATTCCTTCCA 
11301 ATATGGAAAC TTGCCCTAAT AACTAAAGCT AAGATTCCAA AGCCTAAATG 
11351 TATTACAGCT CAAGTATTAA TTCAAATATT TATTGGTTAT TTTTCAGGAG 
11401 TTGAAAAAGT CATTTGGTTG CCAATTGTGG ATTTGGGATT TTATCTATTA 
11451 AAGGG1 1 1 1 I 1 1 1 1 1 1 I I IC TCTTTGCTTT TGTTTCTCTA CAAAGGTCAT 
11501 TGCCACAATG AACACAGCAT TTAATCAAAT TCCAGATTGG CCTTTGAACT 
11551 TGGGATGATG GATAAAATGG ATTTGGGCCA AAATTGAAGT CAAGGAGACC 
11601 AGTTAGAATA TCAAAATAAT TCATATATAA GAAAATGAGA CGTTGGTTTG 
11651 GGGTAGAGTG GTAGGAATGA AAAAAATTAT TTGTGAGCTA ACACAAGGAA 
11701 TAATTTCCAT AGGGCCTAAT AATAGTTAGG TCTGATAATA CTATGGTCTG 
11751 ATAATAGTTT TATTGTATTG TTTACTGAGA GCACAAATGA TGTAACTTCC 
11801 TTATTCAAGA GCTTTTCTAG TTTATTTAAA AATGTGTTGA CATCAGTTAG 
11851 GTTTTAATGT TTTCTATATT TGGACAGTGT GAGCAAACTA Al I IGI I AAA 
11901 TTAAATTCAG AGAGAGATAC ATCTATCTGT AAATACATAT ATGCGTTGTT 
11951 TGTGTTGCTC TTCCTACATA GGTCAGCTAT AAGGCAAATA ATGTTCCTGG 
12001 GTTATCTCAG TTTCACATTT CCCACTGTCA ATATTCCTGC TACTTTTAAG 
jj 12051 TCCCATATCC TGCTCTTTTC TTCCGTCAGT TTCCCCCAGA AGCTCCAAGA 
g 12101 CCCCACCAGG AATCCCCATC CAAGTTTACT TTCCCAACTC CTGGAAGTTT 
O 12151 CAATTGTGCT GCCTTTGTGA CATTATCATA TCTTTTCTGT TCAATGGTTG 
y 12201 CTTCTCTTTG GCTCACTGTT CTCTACTTTT CAGCCTGAGA GCTGGCTAAT 
ijl 12251 CTGGGACAGT ACTCGAATGC AGTGTACACA TGGGTAACAT GGAAAACCCC 
Q 12301 GATTTTCCCT TATATTCAAG GTATTATTTG ACCTTAAGAA AAACTGTTTT 
HI 12351 ACATTTCATA CCAATTAATG AGAAAAAAAT ATTGGCAAGC ACTGACTGGG 
12401 CAGAATACAG GGAAGCTTCA CTATGGAGAA GTGAATTTGG GATTGAGGGC 
£ 12451 CTTTATTGCA ATCTCCTTGT AAATAATATT TGATACTCTT CCTCATCTGG 
™ 12501 AGACACATTC CTAAGTAACT TTTCCTGAAT AATTTGGTCT CCTTGACTGA 
12551 ATCAGTAAGT ACAAATAGAT CCCCAAGCAT GGCTCTTTCC TAGAATGAAA 
12601 GAAATGTCAA GAAGTCTGAA GATGATTCTT GAATTTTGGT I I 1 1 IGCTAT 
12651 TGCTATTTGG GCTTGTTGTC CI IGI IGI IG CTATTGAGTT GAGCTCCTTA 
12701 TATATTCTGG TTACTAATCC CTTGTAATAT GGATAGTCTG CAAATATTTT 
12751 ATCTCATTCA AAGATAATTA TTATTTACTT TCATAGGCTG I I I I IGGTAC 
12801 CAAAGGTTTC I I I I IAGAAG ATAAGAAAAC GAAGATAGCT TCTACCAAAA 
12851 TCTGCAACAA TAAGATACTC TGGTTGATAT GTAGCGAATT TATGTCCTTA 
12901 TGGGCTGGAT CCAACAAGAA AAATATGAAT CAGGTATGTA TGATAATTAT 
12951 AGGGCCATTT GATACCTTAA GAAATTCCAG CTTTCCTTTG ACTCATTTTG 
13001 ATATATCTAT TTACTGTATA AATTCATATG GTATTCCAAA CCCTTAAAGA 
13051 CAGAI I I I I I TTTGCTTTTA AAAATGTTTA TGGGTATATA ATAGTTGTAC 
13101 ATATTTATGA GACACATATA TTTTGATATA AGCATACAAT GTGTAATGAC 
13151 CAAATCAGGG TAATTGGGAT ATCCATCACC TCAAGCATTT ATCATTTCTT 
13201 I I IGI IAGAG ACATTCTAAT TTGACTCTTC TAGTTATTTT GAAATATACA 
13251 ATGAATTATT GTTAACTATA GTCATCCTAT TGTGCATGCC AGACTTTAGT 
13301 CCTTCTAACG GTATTTTGGT ACCCATTAAC CAATGCCTCT TTATCCTTCC 
13351 CCCACCCCTA CTACCTTTCC CAGCCTCTGG TAACCATCAT TCTTCTCACT 
13401 ATCTCTATAA GGTCAGI I I I I I I I lAAACT CCCCTATATG AGTGAGAACA 
13451 TGCAGTATTT GTCTTTTTGT GCCTGGCTTA TTTCACTTAA TGTAATGTTC 
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13501 TCTAATTTCA TCCACATTAT TGCAAATGAC ATGATTTCAT TCTTCTTATG 
13551 GCTGTCTATA TGTACCACAT TTTATTTATC CACTCATCTG TTGATGGACA 
13601 CTTAGGCTGA TTTCATATCT TGGTCATTGT GAATAGTGCT GTACTAAACA 
13651 TGGGGGTGCA GATGTCTCTT CCATGGATTG ATTTCCTTTT I I I 1 1 ICTGA 
13701 ATATAGACCT AGCACTGGAA TTGCTGGATC ATATGGTAAT TCTAGTTTA 
13751 (jI I I I I I GAG GATCCCTCAT ACTCTTCCCC ATAGTTCCTG TACTAATTTA 
13801 CATTCCTACC AACAGTCTGT GCAAGAGTTC TCTTTTCTCC ACATTCTTGT 
13851 CAGCATCCAT TATTGCCTAT LI I I I IGATA AAAGCTATTT TAACTGGAGT 
13901 GAGATAGTAC TTCATTGTAG TTTTAGTTCG CATTTCTCTA ATGATTAGTA 
13951 ATGTTGAACA TTGTTTTTAA TGTACCTCTT GGCTATTTGT ATGTCTTCTT 
14001 TTGAGAAATG TCTACTCAGA TCTTTTGTCC Al I 1 1 IAAAT CAGAI I 1 1 1 1 
14051 TTTTGCAATT GAGTTATATG ACCTCTTTAT ATATTCTGGT TACTAATCCC 
14101 TTGTCAGATG GGTAGTTTAC AAATATTTTC TCTCATTCAA CAGGTTCTTT 
14151 AGTTCACTTT GTTGATGGTC TCCTTTGCTT TGCAGAAGCT TTTTAGCTTG 
14201 ACGTAATCTA ATTTGTTCAT GT7TGCTTTG GTTGCCTGTG CATTTGAGGG 
14251 CTTACCTCAA ATTGGCCCAG ACCAATGTCC CGGAGTGCTT CTGTAATGTT 
i 14301 TGI I I 1 1 I AG TAGTTTCATA GTTTTAGGTC TTAAATGTGT CTTTAATCCA 
g 14351 TTTTGATTTT GTTTTTGTAT CTGGCAAGAG ATAGAGATCT AATTTCATTC 
O 14401 TTCTGCATAT GGATATCTAG TTTTCCCAGC ATCAI I I LI I GTGGAAATTG 
hj 14451 TCCTTTGCCC AATGTATGTT CTTGATGCCT TTGTTGAAAA TTAGTTGACT 
W 14501 ATAAATGTGT GGATTTATTT GTGGG 1 1 L I I TATTCTGTTC CATTGGTCTA 
□ 14551 TGTGTCTGTT TTTATGCCAG TATCATGCAG TTTTGATTAT TACAGGTTTG 
mJ 14601 TAGTATAATT TGAAGTCAGG TCATGTGATG CCTCCAGCTT TGI ILI I I I I 
f 14651 TCTCAGAATC TTATATTTAG AAAAACGTAA AGACTCCAAC AAAAAACCTG 
14701 CTAGAACTGA TAAACAAATT CATTAAATTT GCAGGATACA ACATCAACAT 
14751 ACAAAATTCA GCAGCATTTC AATATGCCAA GAGCAAATAA TCTTAAAAAA 
14801 AAGAAAGAAA AAAAAACAAG AAATAATCCC ATTTATAATA GCTACAAATA 
14851 AAATAAAACA CCTAGGAATA AACCATACCA AAGAAGTGAA AGATTTCTAC 
14901 AATGAAAACT ATAAAACACT GATGAAAGAA ATTGAAAATG ACATTAAAAA 
14951 ATGGAAAGGT ATTCCATGTT CATGGATTGC AAGAATCAAT ATTGTTAAAA 
15001 TGTCCATATG ATCCAAAACA ATCTACAGAT TCAATGCAAT CCCTATCAAA 
15051 ATACCAATGA CATTLTTCAT TGAAATAAAA AAAAAGCCTA AAATTTAAGT 
15101 GGAACCATGA AGGTAGATGT CTGCTATACA TAGAAGATTA AGTACTCAAC 
15151 AAACCTTGAA TATGAAGACT GGGGAAGTGA ATAGGCAGCT TCACTCTTCT 
15201 ATTCCCTGGT GAAATTTAGG AGAATGGATG TTTTATAATG GGTAGCAGTT 
15251 TCTTACATGT TCTCAATCAG CCATAACTTA CTACAGTCAA TTTGAATTTA 
15301 TTGCATTTGA ATATATTGGA TTAAAAATAA AATCCTAAAA AAGGAGAGAA 
15351 GCACATATAA ACCTGCGTCT TATTTCATGT GTTCCTTTCT TTGTGGGTGA 
15401 LI I I IGI I I I GAAATAAAAC CTGCAAAATA ACAGGACAGG GTGGAAGGGA 
15451 GATGGGATCC CCTCTTTATG AAGAAGCAGC AGTCCTGTTT TATCACCTCT 
15501 TCATTTTCTG TTATTGAGAA TTCAAGAAGA AGGAGGAGGA AGAGTTCACA 
15551 TCCACAGACT GGTGTGGTTG AATAGTTGTC TCTACTGTAT TCCAAATAGC 
15601 AGCCAATGAG GCTGTTACAG TGAAGCCAGT CCCAAGATAA TTGTTCTGTA 
15651 CCCCTATTCT CTAAGAAGCT AAATTGTGTT AGACTGAAAC CCATAAGGAA 
15701 CCATTGTTCA AAGTTGGCTT GTTCAAAAGT AAAGAI I I I I AATAGTTTCT 
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15751 GTAATTAGA TTATTTTCTA AGACATAGAA TTATGATTAC TATTTTATCT 
15801 CTATAATTTT CATCTCTATA ACGTTTACAA ATACTGAAAT AACCnTGGA 
15851 AAAAATTGGC TTTTAGCTTT ACTTTTGCAA TATTTTATTT TATCCCCATA 
15901 AAAGCCTAGG AAATTGGTAC TATGACTFTT AGTATGTTCA TTTAATAGAT 
15951 GAAAACACAG AAACTCAAAG ATGTTAAATA TGGTGGCCAA GTTCACAAAG 
16001 CTGATCATTA ACAACAACAG GGCCTGAACT CCTGGTTTTC TGATTTAATC 
16051 TGTGACAGTG CACCTGGGTG CGCATGCATG CATCACCCCC ACACTTGCAC 
16101 ATAGAACCTT TCCTAGTTGG CTTTGCTCCA TGATGACCAT TACTGTTCCT 
16151 TCTACTTCAA AATAAGCAAA TTATCCTACA GATTCAGAGC TGGTACAGGT 
16201 GTGCTGTCAA GCAGCCCATT CCATTAGTCA GCTTGTGGTT CACTCACATT 
16251 AAAGTATTGA CCTAAATGGT ATATTTATCT AGATAATTCT ACL I IGI I AT 
16301 TTTCAAAGCC CCAGTCTTGT TTGCTAATTC TGTGCATCAT TTTTCTCTGA 
16351 TTCTGAAAGG CAAAATTTTG TTGGGCAATT GCTGTAATAT GAGTTTTATC 
16401 TCCTTTAGAG TCGAATGGAT GTGTATATGT CACATGCTCC CACTGGTTCA 
16451 TCAGTACACA ACATTCTGCA TATAAAACAG GTAGAGTCTT AGTCATGGAA 
16501 AACCATTCCA ATCCTTATTT TCAATATATT TAAAAAGACA GAATTGACCC 
= 16551 TGTTAACAGG CCTACCCTAA GAATCTTAAG AGCTTGCTTC CAGTTTGTCC 
p 16601 TTGCTGCCTT CTGTATGCCT TGATTTCCCT GGAATTTAAG AGAAAGGATG 
3 16651 TTATGGTACA GACCAAGTAG ATGACATAAA TGAACACCAC CTTAAATCAG 
UJ 16701 AGTTTTAAAA ATAGGCCCTG AACTGAAGCA AGAGGTAAAC TAGGGAAGCC 
yj 16751 TCAGGAGAAC TGAGACTTCT CCAGAGAGAA GTATCTGGGA TTTAACTTCT 
□ 16801 TTCTAATGAG GCTTGGTTTT CCATGAACTT TTCCTTTAAA CCAAGGGGGG 
^ 16851 TATTGCTCAT CTTTCTGTTG AGCCCCATTT GTCATAATTG TAAAATGGGT 
16901 GGTTACATCC TTCTGGTGAT CTAGGAGCCC TATTTTCGTC CTAGCATACA 
16951 GCATTTTTCT AAAATTTGCT GTTAGCTTTC ATGATTCTTA CCCTAACTAT 
17001 TCI I I I I OA AAAAACATTT GTTTCAGCTT TACCACTCTG ATGAATTCAG 
17051 AGCTTATGAC TGGGGAAATG ACGCTGATAA TATGAAACAT TACAATCAGG 
17101 TGAGCTATTT ACAGTAACCC CAGCATGCTG ATTTTGATAA ATTATAATAA 
17151 AAAATTATTT GAGGGTGGAA AGACTCCTAC CTGTCATTTG GTGGCATTTA 
17201 TACTGATAGA ALI I I I I I I I AAAAAAATTT TAATTTTAAT TTTAATTTAT 
17251 TTCAGAAAAT TTATAAATTA AAGAAGCATA TACAAAGAAA CTTACATCAT 
17301 GTGTAATCCT TCCATCCAGA GATAACTAGA TGTACTAACA TTTTGGTGTA 
17351 TTTATTCCAA TTTTCTCAGT ATTATATTGC TTTTAGACAA CTTTTAATCT 
17401 TTCTATTTTA CTTAAGCTAT AGTAAGAGAT AACTAATATA ACTGAGGGAT 
17451 TTTTAAATGC Al I I I IAATG GCTACATAAT AGAAATTATT TCATAAAAAT 
17501 CTTTACAGCA TAAATGAATA TACALI I I I I AATACCAACA GAAAAATTAG 
17551 AATTCCATAT GAAAGTTGAA TAAGTATTAC CCAACATTGA AGACTTGGGT 
17601 CGTAAGGCAT CTTTCTCCAT ATAGCTTTAT GACATAAAAA TCTGTAGCCT 
17651 TGTTTAGCAC CGTACTTTTA ATTAATCCTG TCACCATTTT TCTGTTCTCA 
17701 TAGCCAGGGG CTTGGCTTAT AAGTATGAAC TAAGCAAACT AAATTAAATT 
17751 GTTTTAAGTA TTTTCCCAGG CTATCATATT TTAAGCTATT TACTGGTGCA 
17801 ACTATAGATT ATTAATAAGT TGTTTCTGAG GATCAAAACA ATCAGACTAA 
17851 TCAATTTCTC AATAATGAAT TGGCCTGTTA GAGGAATAAT TCTACTAATC 
17901 CTTAAAACCA CTACAAGAGA TAGACCATGT ATATTTTATT T A I 1 1 I I A AA 
17951 AATAAGTTTA AGATGTGATT TACATACAAG AACATTACTA ATTTTGTGTG 
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18001 TCCCATTTAA TAAGTTTTGA CAAATATATT TATTTGTGTA ACCACACCAC 
18051 AATCTAAATA TAGGACGTTT ATATCACCAC TAAAAGTTTT TTTCCTGCTC 
18101 CTGAGACTAT TTATAGACAC AAATGCGTGT ATTTGCAAAT GCTTA6AAAA 
18151 GGTCTAGAAA AAAAAACAGT AAATGITAAA GTGGTTATCT TCAGAGAGAA 
18201 GAAAGAAGAA AAGAAGTGGA TGGACATGAA ACAGTAAAGG ACCCTCATTT 
18251 TGGACTTTAC ATATGTCTGT TTTCTTCCAT TATTTTGAAT AAACATGCTA 
18301 TATTTATAAA TTATTTACAT TTACAAGAAA ATGAAACAAA ATCAACACGC 
18351 ACATTCAAGA TCATTATGGT CAAGTACTAA AGTATGTGAG AGTGTTAATG 
18401 TCCTTAGAAT TTGGCCACAG TTAGCTGGTC CTACTCTGCT CCAAGCCGGT 
18451 CCTATTTTGT GAATTAATCT CATTTGATGC CAAI I I I IAT TACATTCTCT 
18501 CCAAAAAACT AGTCTCAACA GTTTGCTCTC TCCTCAAGTT CACAGCATTA 
18551 TCTCTGCTAT ATCTATATTT TATTGAGTAT AAGAGAATTA ACCCATGTAA 
18601 GCTCCATGAG GGTAGGGATT TCTCATCGTT TTGTTCACCA GTGTTTTCTC 
18651 ATCTTGAAGA GTACATGACA ATTACTGGGC TCCCAGTATC TATGTGTTGC 
18701 ATTAATGAAA TTTCTTAACT TTAATCTACC TCAAAATGTC TCTATCTTCT 
18751 TGATTCTCTC CTTCCTTTCT CTATCAGAAA ATGATGGTCC TCTTATTTTC 
18801 CAAGTTATTC CGGTCCTGTG CCCTTGATCG CATCTCTTCT CACTTCCCCT 
O 18851 TCCTTCCTGC CTCCATTCTC CTGTCCCTTA TGAAAAACAA GCAAGACCAT 
O 18901 CAATTCTATC AAGTTATCAT TATGTCACTC TGTTCTTATC AACATATTTT 
UJ 18951 TAGTATTGAA GAGGGCTTCT TCTACTTACT CCTGAACCTT GTACAATGTA 
W 19001 GTTTAGGTCT TCATCI I I I I ATCATAGCTA CCTTATTTAA AGTCACCCAT 
19051 GGCTTTTAAT TGCCAAATTC AATGGCCTAT CTTCACCTTT TGAAATGTGT 
19101 TATGTTCGTT ACCACAGTCT CCTTGAAACT CAGTCCCCTG ACTTGGACTT 
19151 CCATAACACA ATGATTTCTG ATTTTCCTTC TGTTTGTGAT TGTTCCTTTT 
19201 GTCCCAGGCA CTGGCTACTC CACCTTCCAC CTCTCTGAAA TCATTAGCAT 
19251 TCCCCAAGGA TTCTTCAAAA CTCTCTTTCT TCCTTGGAGA AGTCAGCATA 
19301 GCTTTAATTT GGACCATTTC TATGGCTTAT CTAGAI I I I I TCAGGACTTG 
19351 CCTTCAACCT ATTCTTTCTG TAGGTGATTC CATTAACTGT TGCCCATATG 
19401 GTAGTCCGAA GACAGACCTC CGAGAAATGA CCCTTGTCTC CAAAACTTCC 
19451 GCAATATGTC CAAATTTCCT AGCCTGACAT TCAGACTTTG ATTATCTGCC 
19501 TCCAAGTTTA TATCCTATCA TATTCCTTTA TATATTCTGT TCTCCAGGTA 
19551 CACTGGGAAG CTTGCCATTC CTGATCATAG CCTACAAACT CTTCCTGCCT 
19601 CCCACTCACC CTCATCTCTG CTGTCAAAAT GCAACCTTCC CTCAAGAGTC 
19651 ATTTCACAGG ACCCCTCTTT CTATGAAGCC CTCAGGTGGA AATAATTTTT 
19701 TGCCI I I I I I TCCATTTTAT TTTTGGAGTG TTTATGGCAT TTAACATACC 
19751 TTACTTTGTA TACAAATATT TGCCTTGCTC CCTCTTTTGC AAA I I I (J IA 
19801 AAGGTAGAGA CCATTGTATG I I I I CI I CAT ATGTTGCTGG TGCCTAACAG 
19851 AACTATGGCC ATTGTCCACA TTCATTTAGC AGCCTTTGTA GTTATTGCTT 
19901 TGAGGAGCTT CCTCTCATGA ATGCCCTTGC TTTCTCTCCC ACAGAGTCAT 
19951 CCCCCTATAT ATGACCTGAC TGCCATGAAA GTGCCTACTG CTATTTGGGC 
20001 TGGTGGACAT GATGTCCTCG TAACACCCCA GGATGTGGCC AGGATACTCC 
20051 CTCAAATCAA GAGTCTTCAT TACTTTAAGC TATTGCCAGA TTGGAACCAC 
20101 TTTGATTTTG TCTGGGGCCT CGATGCCCCT CAACGGATGT ACAGTGAAAT 
20151 CATAGCTTTA ATGAAGGCAT ATTCCTAAAT GCAATGCATT TACTTTTCAA 
20201 TTAAAAGTTG CTTCCAAGCC CATAAGGGAC TTTAGAAAAA ATGGTAACCA 
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20251 ACAATGAGGT TGTCCCCCAG CACCCTGGGG GAGATGCACA GTGGAGTCTG 
20301 TTTTCCAAGT CAATTGTGTT AGTGTTATTT ATGTTTAGAG ACATCTTTGC 
20351 ATGGGACCAT CTACAGGTCC TTATAAACAA TGAGGTAGAT TAGGCAAAAA 
20401 GATAAACAAG TTGCTACTCT ATCTGGCATT TAAGTCTAAT TAAATTGTAA 
20451 I I I I IAGGGC ATACCATGAA GTATAGAAAT GTCTGAAGCT TCAAAGGAAC 
20501 AGTGAAATTC CTTTAAGGTC CTATATGGAA ACCTCTGTTG TCATTTTATT 
20551 TATATGGATT GCTATGGCAA TGGACAGAGT GTGGGATTAG GAGGAGGGCC 
20601 TGTAACTTCT TTATAAAAGT TTCTTAGCTA TCCTGAAGAT GTATAGACAT 
20651 TTTTACTTTT TTAGGTATTT TCAACATCAG AAATTCAAAA AAGTCCCCAA 
20701 AGATTCTTCC AGAGAAGCCC TCTTTTCTTA CAATCTTATC CCTGGCTATC 
20751 TGCGTAAACG GAATCTTGAA CCCATAATAG GATACATGTA TAAAATCTTC 
20801 CTTATTAAAG CAGAAATAAA TTGTACAGCA TCAATATCAT TTTATAATCA 
20851 TAGGGAGGCT TCTTTGTTTA GCATGTAATG CCCCCTTTAC AGGCI 1 1 I IG 
20901 TTCTTTGAGG GGTTTGAACA TTCCATGAAA AACTGACAGA TAGGAAACTG 
20951 ACAATAAAAG ATTGAGCTAA AGATGGAAGC AGAAAGTACT AGGCTAGATA 
H- 21001 GTCTCTAAAC ATTAAGTATT TTCTTCCTCC ATCTTAAAAG CAATGAGAAG 
O 21051 CCACCAAAAT ATTTTACCTA ATGGAAACCT GATTGCCGCA I I I I IGTAAC 
21101 CACCACTTTG GCTGCTACAT AGAGAATGGA TTAGAAGATG CCAACAAAAG 
21151 ATTCTGAGCA AGTCTGTAAA TCTGATCAAG TGTTCTGATG CAGGCTGATA 
21201 TCCTTCTGTG CTAAGAGAGA TGATCCTTGG AAAATCCAGA GCCAGCTCCA 
21251 TAATACTTTC CTGCTCTGCT GGCAAATCCA CAAGCTGCTG GCCCCTGGAG 
pj 21301 CCATTCTTCT CTCAAAACTA GCATTCATCA ATTTAATGTA TACGTATTGA 
21351 TGGGGAATAA TGGTCACTAT GAAAACCATG TGATAATATG GAAAAATACC 
h* 21401 CATGATATAA TGTTATGTGA AGAGAAGAAA ATGAAACTGG TAGAACTATG 
J 21451 TGATTGCAAA TATATACAAA TATTAAAACA ATTATATGAC TTTATAAAAT 
O 21501 ATTTGTATAT AATGAAAACT GAAGCAATAT AAAAAATAAA ATTAGTTGTG 
21551 TCAGGGTAGT AACATGATGA GTGATTAATA GTTTTTAATT TTTAATATAG 
21601 TAATGACATA ATGTTACAAC TTGTCCAAAT CTCACAAACA TAATATTCAG 
21651 TAAAGGAAGA TAAACATAAA AGAATACATA TTTTATTATA CAI I I I IATG 
21701 TAGGCTAATT GATGGTTCTG AAAGCCTTAA AAAGCTTACT TTTAGGAGGA 
21751 GAATCATGCC TTGGAGGACT CTAGGGTCCA GAAAAATGTC CTAATACTAG 
21801 AGCTAGGTGC AGTCAGATTA ATTATAATAC ATTTCATTAT TTTGTCTGGA 
21851 ATACCAAGAT GACTTCCAAG CAGGAATGGA GTCTAGCAAC ACTTTACTGA 
21901 TGGGGAACTT GGCCACAGAC TTGTAATACA AATTTTTGGA TATGTTGACA 
21951 ATGTTTCTCC TTAI I I I ICT TACTTATACA AAGCAAGAAA TTTGGCTCAC 
22001 AACCTTGAAA CAGACTTACC AGGTTCCTCC AGTTTCCCAA GCCTCAATAT 
22051 CTCATTGCTA TTTTTAA 
(SEQ ID NO: 3) 

SNPS: 

DNA 

Position Major Minor 
165 G A 
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Context: 

DNA 

Position 



165 



TTATGGCCTAACU I I 1 1 MCTTTGAGTTATTTTCMGAGAAMTTTGAAAAAGCAGCCT 
TTGAGGAGAMGMGCMTCCMCAMCAAAMGATMCCACACTGTMTA^ 
TTTTGAATAGGACATTGGAAGAAAAATAATAATCA I I I I IACAG 
[G,A] 

TAGATCCCAMGTCAAGGATCTATGTTCAACCATGTGTGTTCCACCAT 
ATGAGTAACCATCATTAAGCAGTTAGCTTAGGCCGTMTATGATT 
CAAAMTACCACAGGCCTTCTGAMGGTTACCCCTTTCTAGCTCCACT^ 
TATTAAAAAAAAAAAAAAAGGAAAMTTTGAGCTTCTAGAGAGTAGGGGCTAC^ 
TATCCCACAGGGCCAAGGAACAAGTTTT AATGTATTCATTTAMTTMTTTCAGTATGAG 



226 TTATGGCCTAACU I I I IMCTTTGAGTTATTTTCMGAGAAMTTTGAAAAAGCAGCCT 

TTGAGGAGAAAGMGCMTCCMCAMCAAAMGATAACCACACTGW 
TTTTGMTAGGACATTGGMGAAAAATAATAATCAI I I I I ACAGGTAGATCCCAAAGTCA 
AGGATCTATGTTCMCCATGTGTGTTCCACCATCTTCACAATTGA 
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[A,G] 

TGAGTMCCATCATTMGCAGTTAGCTTAGGCCGTMTATGAT^ 

AAAMTACCACAGGCCTTCTGAMGGTTACCCCTTTCTAGCT 

ATTAAAAAAAAAAAAAAAGGAAAMTTTGAGCTTCTAGAGAGTAGGGG 

ATCCCACAGGGCCAAGGMCMGTTTTMT^ 

ATTGAMTATATMTAGAMTATTGTMCATTATA^ 

TTATGGCCTAACC I I I I I MCTTTGAGTTATTTTCAAGAGAAAATTTGAAAMGCAGCCT 

TTGAGGAGAAAGMGCAATCCMGW\CAAAMGATMCCAC^CT 

TTTTGMTAGGACATTGGAAGAAAAATAATAATCA 1 1 I 1 1 ACAGGTAGATCCCAAACTCA 

AGGATCTATGTTCMC(^TCTGTGrrCC^CCATCrrCACMTTGMTGAG 

[T,C] 

MCCATCATTAAGCAGTTAGCTTAGGCCGTAATATGATTCTrGGACTG^ 
TACCACAGGCCTTCTGAMGGTTACCCCTTTCTAGCTCCACT 
AAAAAAAAAAAAAGGAAAMTTTGAGCTTCTAGAGAGTAGGGGCT^ 
ACAGGGCCAAGGAACAAG I I I I MTGTATTCATTTAAATTAATTTCAGTATGAGTATTGA 
MTATATMTAGAAATATTGTMCATTATATATTTTCTATATALI I I I ATTATATAGAAA 

CTTTGAGGAGAMGAAGCAATCCMCAMCA^ 

TGTTTTGAATAGGACATTGGAAGAAAAATAATAATCA I I 1 1 I ACAGGTAGATCCCAAAGT 

CMGGATCTATGTTCAACCATGTGTGTTCCACCATCTT 

ATTMG(^GTTAGCTTAGGCCGTMTATGATTCTTGGACT 

GGCCTTCTGAAAGGTTACCCCTTTCT AGCTCCACTATCATCTAATTTTATTAAAAAAAAA 
[A,-] 

AAAMGGAAAMTTTGAGCTTCTAGAGAGTAGGGGCTACCATTT^ 

AAGGAACAAGI I I I AATGT ATTCATTTAMTTMTTTCAGTATGAGTATTGAMTATATA 

ATAGAMTATTGTMCATTATATATTTTCTATATACTm 

TACAGMTATATTATTAMTATTGTAGMCMTATATMTACAGAAAAATATATAATACT 
CAGTMTATATTAMTACTTATTAAMTAGCMG 

GCAGTTAGCrrAGGCCGTMTATGATTCTTGGACTGAGATTTCAA^ 
TCTGAMGGTTACCCCTTTCrAGCTCCACTATCAT^ 
AGGAAAAATTTGAGClTCrAGAGAGTAGGGGCTACCATTTTGTA 
MCMGTTTTMTGTATTGVTTTAMTTMTTTCACT 

AMTATTGTMCATTATATATTTrCTATATACTTTTATTATATAGAAMTATATA 
[G,T] 

MTATATTATTAMTATTGTAGAACAATATATMTACAGAAAA^^ 
ATATATTAMTACTTATTAAAATAGCAAGCTT ATATAGGMGAGTGATGGAGCATTGTGA 
GAAAGTTTCAGCTTTA I I ICI I I GACATTAL I I I C 1 I I CTGCACAAACAAAAGAATTACA 
GGMTTGTCCAGATTATTCAMTMCTCGMGTTGAGGAGGGMTATMCT 
AGAMCTCTTTTMGATTTGAGCTAGCCTACAATCTGTAM 

AGGCCTTCTGAMGGTTACCCCTTTCrAGCTCCACTATCATCT 

AAAAAMGGAAAAATTTGAGCTTCTAGAGACTAGGGGCT^ 

CCMGGMCMGTTTTAATGTATTCATTTAAA^ 
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TMTAGAMTATTGTAACATTATATATTTTCTATATAC I I I I ATTATATAGAAAATATAT 
ATrACAGMTATATTATTAMTATTCTAGMCMTATATMTAGVGAAAAATATATAATA 
[C,T] 

T(^GTMTATATTAAATACTTATTAAMTAG<^GCn"ATATAGGMGAGTGATGGAG(^ 
TTGTGAGAAAGTTTCAGCTTTAI I ICI I I GACATTAL 1 1 IGI 1 1 CTGCACAAACAAAAGA 
ATTACAGGAATTGT CCAGATTATTCAMTMCTCGAAGTTGAGGAGGGAATATAAGTCAA 
TGATGTAGAMCTCTTTTMGATTTGAGCrAGCCTACMTCrGrAAAGATCT 
GMCTATATTTGTGCTATTTCCATATTMGTCMGGCMCAM 

1621 CGGCTTMGCTCCACAGGCATACAMCTG 

TATCTGGTATCTCATGTGGGGCTTAGAGGTAMTTGTCGTTATn"GGCCT 

CTTTMCCACXGGTGTAMCAMGGTTAC^^ 

TTGGCATGrGMTTAGTTTCCrCTGCCATACTGCTAGrrCO 

GATTTAGGAGTC^GGGrTGCCTCATCTTCTCAMTGAGTTACAGT 

[A,G] 

CACTGCATGGTTGGCACTAGTTCCTTGATATATGTTACT^ 
£ TCAMTGGGGAAGGGAGATACTATTGTCrCTGATTGTCCATTMGATCTTG^ 
5 TACXTCCCTGTTTGACACACTGG 

P TACTCAGTGGAMCATGMGGATTCCGTCAMCTGGTTATTT^ 
2 ATTTCCCMCCTGCMGTGCATCATGGCCriTGGTGTGTCAQ 

w 

□ 2330 AAMGTTCAGMGTTCCTCATCAATMGGAGTCCTTGTGAGGVGGTGM 
fU TAGGTMGATGMGATCTATCATMCGVGGAGGCAGGTTGGM 

CAGTCAGGTGCAAGAGCrcrGC^GTGAGGCTGCCTGAGTGTCCATCCTA 
^ TCrrGGCrCTGTGACCrTGAG^GGTClTAMTCTCTCTAAGCLI I IGI I 1 1 1 I I AATTG 

Of ATAAAATGAGGATMTMTAGTACCAAMTTAGGGAGATTTTCAGAGCIT 
S LC,T] 

q GTGMCTATTTAGAGTMTGCCTGC(^TMGGGGACTCAGTAGOT 
AGV\TTTGAAMGTTTCATMTATTTGCAGATATAAGATGATCTTO 
TGTATGeAAAGCTATTTAGCTraGM 
TITGTTATAGTGMTTATCTGTMTATTTATCrCTTOT 
GAMGOVTTTATTMGAACTTACACTGCACTAMTGTTATATATGAC^ 

2498 AGATCTCTCACCTCrrGGCTCTGTGACClTGAGCAGOT 

I I I I I I I MTTGATAAMTGAGGATMTMTAGTACCAAMTTAGGGAGATTTTCAGAGC 
TTAMTMCATACGTGMCTATTTAGAGTMTGCCTGCCATMG 
TTATTAGTTTCATAOVATTTGAAMGTTTCATMTAT^ 
ACCAGATAGCTMTGTATGCAMGCrATTTAGCTTCAGMGTAMCTCT 
[A,G] 

GTTAAATATTAL I ITG1 I ATAGTGMTTATCTGTMTATTTATCTOTGCTC^CTTTTAT 
MGAAAMTAGTGAMGCATTTATTMGMCTTACACTGG^ 
TMTCCTCACTATMCCCTATGAGATAGGTTACATTATTGTCC^ 
AMCCMGAGACAMGCTACTAAMCACTTGCCrGAGGTTAGACATL I ICI ICTGTGGTG 
AGGCTGGATTTOWVTTTAGACCAT^ 
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TTCTAGAAGTTAAATATTALI I IGI I ATAGTGMTTATCTGrMTATTTATCTCTTGCTC 

ACTTTTATMGAAAMTAGTGAMGCATTTATTMGMCrTACACT 

ATATGACrTMTCCTCACTATMCCCrATGAGATAGGTTACATTATTGTCCT 

TMCMGGAMCCMGAGACAMGCTACrAAMCACrTGCCrGAGGTT AGACATL 1 1 CI I 

CTGTGGTGAGGCTGGATTTCAMTTTAGACCATTTGACTGTAGCA 

[T,C] 

GCrGTTTAGTGrrATAGTGTTGGrCrACCTTTGMTAGACA 
GGMGTGAGACTGCACATTGAMTATGTAAMTTTGCCrrTGG 
GTCACATCACTAGAAACTAATCATAAGCI 1 1 1 GTGTTTGGTTAAAG 1 1 I I ATTGATCCAT 
1 1 I ILI IGI I 1 ACnTGTGGGATACTGGGCTT AACTAGGGGATACCTCCAL I I 1 1 IACTT 
GGCCATGGTATGAAMCCTGTCCTCTGMTCTTTAGATATTTTGGC^ AGGCAAA 

ATTTATTMGMCTTACACTGCACTAM^ 

TATGAGATAGGTTACATTATTGTCCTMTTTTACTMCMGGAMCCM 
ACTAAAACACTTGCCTGAGGTTAGACATL I ILI I CTGTGGTGAGGCTGGATTTCAAATTT 
AGACCATTTGACTGTAGCACTTATATGATGAGCATGCrGTTTAGTGTTATACT 
TACCTTTGAATAGACATACI I 1 1 AAACCATGGCAAGGAAGTGAGACTGCACATTGAAATA 
[T,C] 

GTAAMTTTGCOTTGGGTGCCACGTGAGAMTAGTCACATCACTAGAM 
GCTTTTGTGTTTGGTTAMGTTTTATTGATCCA 1 1 I I ILI IGI I I ACTTTGTGGGATACT 
GGGCTTAACTAGGGGATACCTCCAL I I I I I ACTTGGCCATGGT ATGAAAACCTGTCCTCT 
GMTCrTTAGATATTTTGGCAMTTGTAGGCAAACAMGACT^ 
ATTAAMTMGACCAAAMTGCCTCCATACTTGATTAMTTTATTTCA 

TTATTMGMCTTACACTGCACTAAATGTTATATATGACTTAATCCTC^ 
TGAGATAGGTTACATTATTGTCaVVATTTTACTMC^ 

TAAAACACTTGCCTGAGGTTAGACATL I ILI I CTGTGGTGAGGCTGGATTTCAAATTTAG 
ACCATTTGACTGTAGCACrrATATGATGAGCATG 

CCTTTGAATAGACATAL I I I I AMCCATGGCMGGAAGTGAGACTGCACATTGAAATATG 
[T,C] 

AAMTTTGCCTTTGGGTGCCACGTGAGAMTAGTCACATCACTAGAM 
TTTTGTGTTTGGTTAMGTTTTATTGATCCA I I I I ILI IGI I I ACTTTGTGGGATACTGG 
GCTTAACTAGGGGATACCTCCAL I 1 1 I I ACTTGGCCATGGTATGAAAACCTGTCCTCTGA 
ATCTTTAGATATTTTGGCAAATTGTAGGCAMCAMGACTTAM 
TAAMTMGACCAAAAATGCCTCCATACT 

TATGACTTMTCCTCACTATMCCCT 

MCMGGAMCCAAGAGACAMGCTACTAAMCACTTGCCT I ILI IC 

TGTGGTGAQGCTGGATTTCAMTTTAGACCATTTGAC^ 

GCTGTTTAGTGTTATAGTGTTGGTCTACCTTrGMTAGACATACTTTTAMC 

GGMGTGAGACTGCAGA7TGAMTATGTAAM 

[A,G] 

TCACATC^CTAGAMCTMTCATMGCTTTTGTGTTTG^ 

I I ILI IGI I I ACTTTGTGGGATACTGGGLTTAACTAGGGGATACCTCCAL I I I I IACTTG 
GCC^TGGTATGAAMCCTGTCCTCTGMTCTTTAGATATTTTGGCAMTTGT AGGCAAAC 
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AMGACTTAA^GCMTTCAACCTTGATTAAMTMGACCAAAM 
TAMTTTATTTCATTTTAGGMCTGGATTAT^ 

3076 CTTATATGATGAGCATGCTGTTTAGTGT^^ 

TTTTAMCCATGGCMGGMGTGAGACTGCACATTGAMTATCT^ 
TGCCACGTGAGAMTAGTCAGAT(^CTAG 

AGTTTTATTGATCCA I 1 1 I IU IGI I I ACTTTGTGGGATACTGGGCTTAACTAGGGGATA 
CCTCCALI I I 1 1 ACTTGGCG\TGGTATGAAMCCTGTCCTCTGMTC1TTAGATATTTTG 
[G,T] 

CAMTTGTAGGCAMCAMGACTTAMGCMTTCAACCTTGATTA 
TGCCTCCATACrrGATTAMTTTATTTCATTTTAGGAACT^ 

TCTACATGAAAAMTAGATTMTAGTGCTCCMGTTAGTTCACTCT I I I I I 

ATACATTATCTGCCTT CGGTGTTATTCMGTrTTTCATTMTCATTMTMTTTCACTAAT 
CATTTTATTTCATTMTCAACATTGATAGTTAAMTTMT I I I I 

u 3745 TGGrC^TTCCTTC^TTTGGAAAATGMGTGMTCCrGAGGTCTGGATGMTACTGTAAG 
TCATGGAAMCTGTGMGMCATCAMTA^ 

AGGTCCTGTTGTMCAGAAMTCTCTGATAAMCAGATAAMTGTAGATG^ I I I I IAACC 
5 TCTGCMGAGTCAAGCTAGTTAGATCTTTGTCTGAAAMOV^ 
y MCCAAATTGTGCTATTGTGCTATCTATCTATCTATCTATCr 
W [C,G] 

O TATCTATCTATCrATTTATCTATCTATCTATAGATAGMCCTCCTL 1 1 I I GAATTTATGT 

nLl TTTAAGAATATCAAGCTA I I IGI I GATATACATGA7TGCCTTCTATTGATCTATAGTTCT 

B u ATTACTTTTAMGCAAGAGGGGTCTCAAAAGACMTTGAC^ 

GAMGAATGGGTGAATGCTAAATTTTCC^ AGA 
q TAI I I I I lAAMTTOACrrATTTTGTATTMGA 

5 

□ 3752 TTCCTTGATTTGGAAAATGMGTGAATCCTGA^ 

|=i= AMCTGTGMGMCATCAMTAMGCAGGACTMTGGAGTATGAGGTTACGAMQ 

GTTGTMCAGAAMTCTCTGATAAAACAGATAAAATGTAGATGG I I 1 1 I AACCTCTGCAA 

GAGTCMGCTAGTITAGATCrTTGTCTGAAAMCAAATA 

TTGTGCrATTGTGCTATCTATCTATCTATCTA^ 

[T,-] 

CTATCTATTTATCTATCTATCT^^ 

ATATCAAGCTAI I IGI I GATATACATGATTGCCTTCTATTGATCTATAGTTCTATTACTT 
TTAMGCMGAGGGGTCTCAAMGACMTTG^ 

TGGGTCMTGCTAMTTTTCCCCCAACCCCCCAAMTATTAGCCAATAGTAGATAI I I I I 
TAAAATTCTACTTATTTTGTATTMGACTTTAT^ 

3762 TGGAAMTGMGTGMTCCTGAGGTGTGGATO 

GMCATCAMTAMGCAGGACTMTGGAGTATGAGGTTACGAAAGGT 
AAAATCTCTGATAAAACAGATAAAATGTAGATGG I I I I I AACCTCTGCAAGAGTCAAGCT 
AGTTAGATCTTTGTCTGAAAMG\MTACTGTCCGGT AATGAAAACCAAATTGTGCTATT 
GTGCTATCTATCTATCTATCrATCrATCTATCTATCTATCTATCTATCTAT 

c-.c.n 
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ATCTATCTATCTATAGATAGMCCTCCTCITTTGMTTTATC 

ATTTGTTGATATACATGATTGCCTT^ 

AGGGGTCTCAAMGACMTTGACTTGATMTATAGCr^ 

CTAMTTTTCCCCCMCCCCCCAAAATATTAGCCAATACTAGATA 1 1 1 I I IAAAATTCTA 
CTTATTrTGTATTMGACXITA^ 

AAAGCAGGACTMTGGAGTATGAGGTTACGAAAGGTCC^^ 

TAAAACAGATAAAATGTAGATGG I I I I I MCCTCTGCAAGAGTCAAGCTAGTTAGATCTT 
TGTCTGAAAMCAMTACTGTCCGGT^^ 
CTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCT 
CTATAGATAGMCCrCCTCTITTGMTTTATGriTTMGMTAT(^ I I IG I I GAT 
[A,G] 

TACATGATTGCCTTCTATTGATCTATAGTTCTATO 
MGACMTTGACTTGATMTATAGCTTTGTCAGAM 

CCCAACCCCCCAAAATATTAGCCAATAGTAGATA 1 1 I I I I AAAATTCTACTTATTTTGTA 
TTMGACTTTATTTATTMTTTTACAGTTACCTGGTGCT 

CTAATAAGCACACAACAGATG<j I I I(jI I I I GATTCL I I I I I ATATCCTTTGGAGAAGTTC 

Gl I I IGATTCCI I I I I ATATCCTTTGGAGMGTTCCACTAACGACTGTA 1 1 I I IACTGGG 

CAGAGTGAAATCATCATCTACAATGGCT ACCCCAGT GAAGAGTATGAAGTCACCACTGAA 

GATGGGTATATACTCCTTGTG\ACAGMTTCCTTATGG 

GGTACMGATATGTCTCTCCTGAAAAGGGG^CTGCATTGACCrCCT 

ATTTAATGCTAGATATGCATCMCAGAGTTTATO 

[T,C] 

CTTTAAATAGTTATCAGGGAGGCT CACTCTTTGCCTGATMTTCTCTGAAGACAGACAGG 

MCCTAAAMTACAMCAGCMGACTGATCTTGCTMCrGC^ 

GGTGTAMCAGAAAGGCAGAGCCTGCATTTTGTt^^ 

AAATTGCTTTGTCCCAGGAAAATGGATCCrcrC^^ 

TATGAMTTGACTCTGGGGCACCCMGMGAACCTCTCCTGCTCCCA 

MTTGACTCTGGGGCACCCAAGMGMCCTCrCCTGC^ 

CCTCTGCAGGATAAAAMCMTCTAGTTAMT^ 

GACTGAAMCCTTMCATCCACATACACITTGATCTMGGGAG^ 

AMGAGTATGGTGTCMTMGGCrTGMTTCTAGMTGAGGAGCC^ 

GGGGMTGATACTCCTTAAMGGGAAMTTTMCTACAMTCCTCT 

[A,G] 

AGMTMCCAAAATATCTGCAATGGTTCW 

GTGTTCATTTTGCATU I I I I I CCCACCACACATATTMGGAGG^GCTGMGTCATGTTT 
GACATTCTCTCCCTL I I I I ATCTCCAGTTTCAGMTGAAAAATGAGAGTGAGATATGAGT 
AGTTTTACTAGTTAAMTATGAMCACCCAGTTAMT^ 
TMTTTTGTATMGTCTCATTTTAAGATMTACT^ 

GTTTTCCAGGACTGAAMCCTTMCATCCACATACAC^ AAGGGACAGACGGTT 

CATAGAATGAMGAGTATGGTGTCMTMGGOTGMTTCTAGM 

GCCATAGCAGGGGAATGATACTCCTTAAAAGGGAAAATTTAACTACAA^ 
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AGAMTGATMGMTMCCAAMTATCTGCMTGGTT^ 

GCTGCTTACCGTGTTCATTTTGCATC I I I I 1 1 CCCACCACACATATTMGGAGCAGCTGA 
[A,G] 

GTCATGTTTGACATTCTCTCCCTCT^ 

GATATGAGTAG I I I I ACTAGTTAAMTATGAMG\CCG\GTTAMTTTGAAGGTCAGATA 
MCMCAMTMTTTTGTATMGTCTCATTTTMGATMTACT 
TCACTATTATCACTATTTATAAMTTTTGTAGAGCATCCTGGATL I I I I IGCTTACTTTT 
Gl I I I IAI I I I I I GCTAAATCTGGCAATCCCAGGCACATGTGTGAAGGAGCTGTGAAATA 

5280 AMTMTTTATTGGCAGCrGCTTACCGTGTTG^TTTTGCATL I I 1 1 I I CCCACCACACAT 

ATTAAGGAGCAGCTGMGTCATGTTTGACATTCTCTCCCTCTTTTA 
ATGAAAMTGAGAGTGAGATATGAGTAGTTTTACT 

MTTTGMGGTCAGATAMCAACAAATAATTTTGT ATAAGTCTCATTTTAAGATAATACT 
AAAAAGTCATTATTTATTCACTATTATCACTATTTATAAAATTTTCT 

[T,A] 

u CI I I I IGCTTAU I I IGI I 1 1 IAI 1 1 I I I GCTAAATCTGGCAATCCCAGGCACATGTGTG 

P MGGAGCTGTGAMTATAAMGGAGAAMCTTTTATGGGAAAGATTTGGC1TM 

p ATMTTTTGGAAAGATTTAGMTTAMGATCATTCATTAGATGTMTGTTCT 

Q TATATCAGTTAMCTTCTCATCMCMTATGAGATGGGTACCACTMTAGTCACCATTTC 

yj ACAMTGATGAMTTMGGCACMCCGGTTATGTTMGAGGCCTAMGTCCACAAATAGC 

w 

y 5790 TGAGATGGGTACCACTMTAGTCACCATTTCACAMTGATGAMT^ 

TATGTTMGAGGCCTAMGTCCACAMTAGCMGCTGACAGACCAGMTTTMGCCCAGG 
CATGCTGGCTCCAGAGCCTGTGCTCTTAGTCA^ 
CACCCTGGrTACTTTGGATCTCCCT 
□ GCAGAGGGACACTGAGCTGAGCATATTATTGTACj I I I I I AAATGCTCTCCACTGGACAGA 

m [A,G] 

O GATGGGGGATTTGMTAGAMTTTGGTGAGGMCTMTCAGTGTCCATTTACACT 
M° CCTCTTCCTCCCTGGMGAGCTATAGGAGTGAGTMGCATGATAMTTTC 

TAMCCACACCCAGGAMTTTGTATATACAMTACATAGAGCACAGTAGTTATCAGGACA 
GACTTTGACATAAAMGMCTGGGTTTGAGTCCCTGCTCTGGCC. I I C_ I I ATCTGGGTGGC 
CCTCTGGGAAAGTTACTTAACTACATAAA(j I I I IGI I I CCATATCTACAAAATGAGGTTT 

5901 MGCCCAGGCATGCTGGCTCCAGAGCCTGTGCTCTTAGTCATTAAATTATAGTGCCT^ 

TTGACCTTCCACCCTGGTTACTTTGGATCTCCCTGMTGCTCTCTCTCCCTCAGAAATAC 

TGGMGTTGGCAGAGGGACACTGAGCTGAGCATATTATTGTAG I I I 1 1 AAATGCTCTCCA 

CTGGACAGMGATGGGGGATTTGAATAGAMTTTGGTGAGGAACTAATCAGTGTC 

ACACTCACCTCCTCTTCCTCCCTGGMGAGCTATAGGACTTGAGTAAGOT 

[C,T] 

GTGTCTTTGTAMCCACACCCAGGAMTrrGTATATACAAATACATAGA 
ATCAGGACAGACTTTGACATAAAMGAACTGGGTTTGAGTCCCTGCrCTGGCL I Id I AT 
CTGGGTGGCCCTCTGGGAAAGTTACTTAACTACATAAAG I I I IGI I I CCATATCTACAAA 
ATGAGGTTTCTCAAAATAGCAGCTAGTTTATAGAG I IGI I GCAAGAATTTAGTAAGCTAA 
TACATATAMTACGTCMCATAGCACCAGGTAOWWVTATGTGCTC^ 



S 
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CAACATAGG\CCAGGTACAAAAATATCTGCTC^GAMCTGAAG 
CTCTATACTATTGACAAGGGAAAAGTGAAAACAGI mTGTTl I ACCATGTGTGTATGTG 
TGrGTGTCrGTGATGTTTCCGACATGCTCTATTTM 
TCTCTCTCTTTCTCITTCTCC^ 

AGTTGTGTATATGCAGCATGCCCrGTTTGCAGACAATGCCTACTGGC^ 
[C,T] 

AATGGAAGCCTTGGATTCCrTCTAGCAGATGCAGGTTATGATGTATGGATGG 

CGGGGAAACACTTGGTCAAGMGACACAAMCACTCr^ 

(KCTTTACCTAAATATTA(XTMGAA 

MTMCGTGGACGCTATTAATGATTATCTTTGACGCrrGAAGTG\TATAGCT 
TTTCTGrTMGATCTCAMGGAGQGTMCAGCMGMGCTCT 

TTCTCTCTCTCTCTTTCTCITT^ 
CGGCCAGTTCTGTATATGCAGGW 

TATGCCMTGGMGCCTTGGATTCCTTCTAG^GATGG^GGTTATGATGTATGGA 
MCAGrCGGGGAMCACTTGGTCMGMGACACAAMCACTCTCAG^ 
TTCTGGGCCTTrAGGTAMTATTAGCTMGAAMCrCMGGGGGAMT^ 
[T,A] 

AAAAAMTMCGTGGACGCTATTMTGATTATOTTGACGCrrGMCTCATATAGCT 
TGTAGrTTCrGlTMGATCTCAAAGGAGGGrMCAG(^GMGCTCTGA 1 1 I I ICACTGA 
TTCTCCC^CMGCAAAGTATGGCATTTC^ACAAGATCA I I I I I ACATCCAATTCTGTGAA 
TTCTATGC^TTAAMGTATGTCCAMGAGACAGCTCAGGAMTTA 
ACATTCATTCAGCCAATGTTrACTGAGTGGCT ACTGTATGCGCTGTTG"AGGCCCCGAAC 

AAGCCTTGGATTCCTT CTAGCAGATGCAGGTTATGATGTATGGATGGGAAACAGTCGGGG 
AMCACTTGGTC^GMGACACAAAACACT CrCAGAGACAGATGAGAAATTCTGGGCCTT 
TAGGTAMTATTAGCTAAGAAMCTCAAGGGGGAMTTGGAGGCAAT^ 
CGTGGACGCTATTMTGATTATCTTTGACGCITGMGTCATATAGCTCCT^ 
GTTMGAT(TCAMGGAGGGTAACAGCAAGAAGCTCTGA I I I I I CACTGATTCTCCCACA 
[A,G] 

gcamgtatggcatttcaacaagatca i i i i i acatccaattctgtgaattctatgcatt 

aamgtatgtccamgagacagctcaggamttat^ 

gccmtgtttactgagtggctactgtatgcgcrgritctaggccccgm 

gmcagacamctctgacctcacamgcttatgttcatt™ 

attgctccrggattgccaatcmctgtgtaaagatgatttggacc^gg^ 

tmtgattatctttgacgcttgmgtcatatagctccttgtag^ 
aaggagggtaacagcaagaagctctgai i i i i cactgattctcccacaagcaaagtatgg 
catttcaacaagatca i i i i i acatccmttctgtgmttctatgcattaaaagtatgtc 
camgagacagctcaggamttatcatgacc^ 
ctgagtggctactgtatgcgctgttct^^ 

C-,T,C] 

TCTGACCTCACAMGCTTATGTTCATTTTAGTG^ 

TTGCCAATCAACTGTGTAMGATGATTTGGACCAGGACCTTAT^ 

GATTGATTTAGAGAMCTGAGATCGCACATAGTACCATTTTCAGGAAMCT 
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GAI I I I IAAAACCI IGI I AATGGGCAATGAAGAAGAATL I I 1 1 1 I GATATC I IGI I ICI I 
TTMTGGAAGAGTTTTCTGCTGTCACCAGAGGAC^ 

7017 GGAGGGTAACAGCAAGAAGCTCTGAI I I I I CACTGATTCTCCCACAAGCAAAGTATGGCA 

TTTCAACAAGATCAI I I I I AG\TCCMTTCTGTGMTTCTATGC^TTAAAAGTATGTCCA 
MGAGACAGCTCAGGAMTTATCATGACCMTCT 
GAGTGGCTACTCTATGCGCTGTTCT^ 
CTGACCTCACAMGCrTATGTTCATTT^ 
[T,G] 

GCCMTCMCTGTGTAMGATGATITGGACCAGGAC 

TTGATTTAGAGAAACTGAGATCGCACATAGT ACCATTTTCAGGAAAACTCCAATATTAGA 

i i i i iaaaacci igi i aatgggcaatgaagaagaatc i i i i 1 1 gatatc i igi i ici i i i 
aatggaagag i 1 1 ictgctgtcaccagaggacaggctgatgcctgcgatagaci i i ici i 
tcttc^ggcctmgctccctgttgg^ 

t 7151 gamttatcatgaccaatgtgcag^ttcattc^gcgaatgtttact 

p tgcgcrgttctaggccccgmg\ttcamcagggmcagacamctct 

2 cttatgttcattttagtgatmttttacmgtg^ttgct 

h gtamgatgatttggaccaggaccttattgatttag^ 

hi actgagatcgcacatagtaccattttcaggaamct i i 1 1 i aaaacctt 

6 [g,t] 

rj ttaatgggcaatgaagaagaatci i i i i i gatatc i igi i ici i i i aatggaagagtttt 

s ctgctgtcaccagaggacaggctgatgcctgcgatagaci i i ici i ici i caggcctaag 

^ ctccctgttggtttgtamcctgatgctagmcagactgtgtattcct 
aaacattcagtacccactgamgtttgagmtagtggag 
tcrgagttcttgggcagkkkkaagcatcagg 

7308 ctcctggattgcovatcmctgtgtam 

gamctgtgattgatttagagamctgagatcgc^ 
caatattaga i 1 1 1 iaaaacci igi i aatgggcaatgaagaagaatc i i i i i i gatatct 
tgi i ici i 1 1 aatggaagag i i i i ctgctgtcaccagaggact^ggctgatgcctgcgata 
gac i i i i c 1 1 i c i i caggcctmgctccctgttggtttct 

[C,G] 

TGTGTATTCCTATTACATTMTAAMCATTCAGTACCC^ 

AGGAATAGAATAGAATGTTATAGTCTGAG I ICI I GGGCAGGGGCMGCATCAGGAAATAT 
TGMTCATTAGTCrTTAGGAGGTCTCACAAC 

ATAGATTTCCTCACATG I ICI I I I AATAMC7\GGCTTCTAGCTTATGGMTACCTGATTT 
GACTAAATGTTATATAGGCCC I I I IGI I CCTCCTGTCTGMGAACAAAATACTAGTACTA 

7321 MTCMCTGTGTAAAGATGATTTGGACCAGGACCTTATTGATT^ 
ATTTAGAGAMCTGAGATCGCACATAGTACCAT^ 

TTAAAACCI IGI I AATGGGCAATGAAGAAGAATC I I I I I I GATATC I IGI I ICI I I IAAT 
GGMGAGTTTTCTGCTGTCACCAGAGGACAGGCTGATGCCT I I I ICI I ICT 

TCAGGCCTAACCTCCCTGTTGGTTTGTAMCCTGATGCT AGAACAGACTGTGTATTCCTA 
[T,C] 
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TACATTMTAAMCATTCAGTACCCACTGAAAGTTTGAGMTAGTGGA 

AATGTTATAGTCTGAG I I CI I GGGCAGGGGCAAGG^TCAGGAMTATTGAATCATTAGTC 

TTTAGGAGGTGTCACAACAATTCTCCrATTCTTGTMGTCCC^ 

CATC I ICI I I I MTAAACAGGCrrCTAGCTTATGGMTACCTGATTTGACTAAATGTTAT 

ATAGGCCC I I I IGI I CCTCCTGTCTGMGMCAAMTACTAGTACTATGGMTATTGGTA 

GCGATAGAL I I I ICI I ICI I CAGGCCrMGCTCCCTGTTGGrrTTGTAAACCTGATGCTAG 
MCAGACTCTGTATTCCTATTACATTMTAAAACATTCAGT^ 

ATAGTGGAGGAATAGAATAGAATGTTATAGTCTGAC I ICI I GGGCAGGGGCAAGCATCAG 
GAAATATTGMTCATTAGTCTTTAGGAGGTGTCACMG 

CCAATCTATAGATTTCCTCACATG I ICI I I I AATAMCAGGCTTCTAGCTTATGGAATAC 
[C,T] 

TGATTTGACTAAATGTTATATAGGCCC 1 1 I IGI I CCTCCTGTCTGAAGAACAAAATACTA 
GTACTATGGAATATTGX^ATATATTAMTATATATCTATATATCCATGTGX^(^GXW\TA 
CTACTACTMCAACATCTTACTGAGCACCCACTGGC^ I I ICI I ICATACT 

ATTAMCCCCGTTAGCAGCCCCGTAMCCAGGrACTACCCTGrrTAT^ 
AMCATAGGXTC^G7\GC^TTTCAGTMTTTCrO 

ATAAMCTGGTCAGGAGAAATTGTATTTCATTGGA 

TGTTTATGAGGGTCACTGTTAGGTGTG I I I I I GAGX^CAGTTTTCTCAGAGTCTTACAG 

GTVTTTCACCTTTATGTTGXWVTAA^ 

CTCTGXZTGTjGTVVTAACCCTACTACTCTM 

TGTAGQGCAAACCTTTCCTGGGTCTCTQGTCACAGCAGCA 

[T,C] 

TTCCCAGGMTMCATCTGTTCC^^ 

ATTCCCTCrGAGCTGAAAAAGTAAMTTCAATGCCATGGAAT^^ 
ATGTGG\TCMTCATCrcrTTCT(^CMCCOWkTGGGA I 1 1 I lAAAAAATAAAAGGGAA 
GGGrTTATACCTATATTTAMCAMTTGTW^CXKATGGTTATA I I ICI I IGTGAGTTGG 
MCACACMGCTTACTATMTAMTCMT^ 

TMGTAGCTGTGAGCCTGCAGrGCACAGACTATATGrAGGG^ 

TGGTCAC^QCAGCATATTGACTACGGrGATGCMTTTC 

ATTCAMGAMTMTTCCACAGAGTMGTITCTAGATTCCCTCTG^ 

ATTCMTGXICATGGAATATGXCTGAAACATAATAAATGTGXATCMTCATCT 

CAACCCAAATGGGA I I I I I AAAAMTAAMGX^GMGGGXTTATACCTATATTTAMCAAA 

[C,T] 

TGAAAAGGCATGGTTATA I I ICI I I GTGAGTTGGMCACACMGCrrACTATAATAAATC 
MTTGAGCTTATCTATTCACTCT^ AAG 
CACTATGTAGAAATTTCTAAAG I 1 1 I I I AAGCTGACAACTTAC I ICI IAATTTACTTACT 
TTACTTMTTTACTTTACMTTTACTTTCCAGGTA^ 
TTCCMGTAAMGTTGAAAGGMCCCA(^CTAATAAAAGCTTTGM 

AMTGTGCATCAATCATCraTTCT<^<^CCCAMTGGGAI I I I I AAAAAATAAAAGGG 
MGQXTTATACCTATATTTAMCAMTTG7WV\GG(^TO^ATAI I IGI I IGTGAGTT 
GGMCACACAAGCTTACTATMTAMTCAATTGAGCTTAT^ 
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TATTTATGAMTAGCA^GTAAATGTMGCACrATGTAGAMTTTCTAAAG I 1 1 1 1 IAAGC 

TGACAAGTACrrGTMTTTACTTACTTTACTTM 

[G,A] 

TATTTTGGAAAGAMTCMTMTOAGTTCCMGTAAAAGTTGAMGG 
TAAMGCTTTGMTTTCTCATTGMCT^ 

CATGTGAAAGTGCMTATTTCAGTTTAGGGAM ATCATCAG 

TMCAMCATATATTCATTAGTATTTTAGATTGACA^ 

CAGTTAGCATCAGTCAGCATATACTAAAAMGTATCAMGA^ 



9967 
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□ 10008 

ry 
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fy 
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GTTTCATTTAGGACATAAATA 1 1 I I I AGTGACTG I IGI I IGCATTTTGGACAGAGCAATT 

TCrGTTATGTMGGAGCACCCACTCTTTGTAGGACATTTAGTAGGTCCCAG^ 

CAGGGCTCTGCAGTCAGCGTGACCCTCAA 

TCTGQGGAAGTACTATTCCTGATTCAGAGTL I I I I I ATCMTTGTTCAGTCAATTATTTC 
AG I I L 1 1 L 1 1 1 1 I CTGGCCAAGACAG I 1 1 I MTGTTCCAACAAGTGTTTCAGTACACACA 
[T,C] 

ACACACACA(^CACACACACACA(^CA(^CACACACATG(TAGTGG^ 

ACCTCTGGAMCCAMTTATATGGATATTCTCCCTAGCCTACCCACTGTTGTGCT 

CCATCCTCACAGATATACAMGGGGTGCAATGCTACT^ 

GATGCCTGGTCCTTACTGGGCCATCGTGGATGCTAGGGAAAGCCCC I I I CI 1 1 I IGGAAA 
CAGGGAAGAGTCTAGAGGGTTGAAAMCACCCAGTMGA^CTGGGAGC^ 



CATTTTGGACAGAGO\ATTTCTGTTATGTAAGGAGG\CCCACT 
TAGGTCCCAGCCCATTAAACAGGGCTCTGCAGT CAGCGTGACCCTCAAAAATCTCACCrC 
G^CACATTTCCAMCACCCTCTGGGGMGTACTATTCCT I I I I IATCAA 

TTGTTCAGTCAATTATTTCACj I ILI I LI I I 1 1 CTGGCCMGACAGTTTTAATGTTCCAAC 
AACTGTTTCACTACACACATA<^0\CA^ 
[C,T] 

AGTGGAGGCCCAGGMGGGACCTCTGGAMCCAAATTATATGGATAT^ 
CCCAGTGTTGTGCTMTCTCCATCCTCACAGATATA 
AMGAGCAMGCAMTGGAGATGCCTGGTCCTTACTGGGCCATCGTGGATGCT 
GCCCLI I I LI I I I I GGAMCAGGGMGAGTCTAGAGQGTTGAAAMCACCCAGTAAGACA 
CTGGGAGCAGTGAMTTTCATTCCATAGTGAGAAAGAAMCCT 



10363 AGCCTACCCAGTGTTGTGCTMTCTCCA^ 

CTGCTGAMGAGO\MGCAMTGGAGATGCCTGGTCCT^ 

GGGAAAGCCCL I I I LI I I I I GGAAACAGGGAAGAGTCTAGAGGGTTGAAAAACACCCAGT 

MGACACTGGGAGCAGTGAMTTTCATTCCATAGTGA^ 

TGGGTGATGCTGCAGAAAGAMTCMTTCACCTCCTGTG^ 

[G,A] 

CTCTGTGATTCATTCTGGCATCTCAGAGTTAGGGATGAMTG^ 
ACCCCATGCrTGGGAAGTTTAC^C^GCAGT AGCTACTCCAGCAGCTTMCCATCACCTTT 
CCCCTfXCAACTACTCCAT^ 

AAMTTGGAGACTTGAGAGCAGAGMGACTGMGGCAGATTA^ 
GMC^CTTCCAATTCATCCCCACT 



FIG 3-22 



Docket No.: CL001186DIV 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS, ... 

TCTCAGAGTTAGGGATGAAATGAGMTGTTGCCAGCA^ 
ACACAGCACTAGCTACTCCACXA^ 

TCCCCCMTCMGTCAAACTGTCCATAMTAGAATAAMTAA^ 
AGAGMGACTGAAGGG^GATTATCTTTATAGMTM 

CAGTATGATCACGATAGMGGAAAAAATGACTAAGCAGAGCCCCAAI I I IGI IAGAAAGA 
[T,C] 

TGCGTAAGTATTTAI I I I I AGAAGATTGTCTTATCTCCTGTTCTCTCAGGGTTTGTAGCC 
TTTTCCACCATOICTGMCTGGCACAAAGAA^ 

ATCTCATTCAAATATCCCACGGGCA I I I I IACCAGGI 1 1 I I I CTACTTCCAAATTCCATA 
ATCMGGTAGGCTCCTITGAACAAAATGTACCTGAGGATCTCATTTTGGATCATAMTC^ 
TTATTATTTTCAAATCTACTGTAMGTAAMGT^ 

TCCTTTCAACAAAATGrACCTGAGGATCTCATTTTGGATCATAAATC 

MTCTACTGTAMGTAAAAGTAGGAMTmGATAAMTCTATAGM 

GGTATGTGCTTGTGTATGTGTGTCCCTGCGTGTGCGCATGTCTGTGCCATAGTATCTGCA 

GGTTCTGTMTACAATTTACTATACMGGTCATCAGCAGGCT^ 

CTAGCTGAACTGAGTGCTATATGACAACAAGGAI UNCI IGI I I ICCCAAGTGI I I I I I 

[G,T] 

TTCCAT7TAGTCAGGTAGGTCMTGAATTCACATTGCCCAMTGAA 
ACCCATMTCACTGATGTGTCCMTTTTGACATTAGAAAMCCT 
CCMTATGGAMCTTGCCCTMTMCTAMGCrMGATTCCAMGCCTAAATGTATTACA 
GCTCAAGTATTAATTCAAATATTTATTGGTTAI I 1 1 I CAGGAGTTGAAAAAGTCATTTGG 
TTGCCAATTGTGGATTTGGGATTTTATCTATTAAAGGG 1 1 I 1 1 I I I 1 1 I I I ICTCTTTGC 

TTTAAGTCCCATATCCTGCTL I I I I CI I CCGTCAGTTTCCCCCAGAAGCTCCAAGACCCC 
ACCAGGMTCCCCATCCAAGTTTACTTTCCC^^ 

TTGTGACATTATCATATC I I I I CTGTTGA^TGGTTGCTTCTCrTTGGCTCACTGTTCTCT 

ACTTTTCAGCCTGAGAGCTGGCTMTCTGGGACAGTACTCGA^ 

TMCATGGAAMCCCCGATTTTCCCTTATATTCMGGTATTATT^ 

[T,C] 

GIN lACATTTCATACG^ATTMTGAGAAAAAMTATTGGCMGCACTGACTGGGCAGM 
TACAGGGMGCTTCACTATGGAGMGTGMTTTGGGATTG^ 

CTTGTAMTMTATTTGATACTCrrCCrCATCTGGAGAGAG^TTCCTAAGTAAL I I I I CC 
TGAATMTTTGGTCTCCTTGACTGMTCACT 

TTTCCTAGMTGAMGAMTGTCAAGMGTCTGAAGATGATTCrTGAATTTT I I I I I 

AGTCCCATATCCTGCTC I I I IGI ICCGTCAGTTTCCCCCAGAAGCTCCAAGACCCCACCA 

GGAATCCCCATGCAAGTTTACrTTCCCMCTCCTGGMGTTTCMTTGTGCrGCCr^ 

GAGATTATCATATL I I I I CTGTTCMTGGTTGCTTCTCTTTGGCTCACTGTTCTCTACTT 

TTCAGCCTGAGAGCTGGCrAATCrGGGACAGTACTGGAATGCAGTGTACACATGGGTAAC 

ATGGAAMCCCCGATTTTCCCTTATATTCAAGGTATTATTTGACCTTMGAAAAACTG^ 

[C,T] 

TACATTTCATACCAATTAATGAGAAAAAMTATTGGCAAGCACTGACTGGG 
GGGAAGCirCAGTATGGAGMGTGMTTTGGGATTGAGGGCCTTTAT^ 
TAAATMTATTTGATACTCTTCGrCATGrGGAGACAGATTGGTAAGTAAG I I I I GGTGAA 
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TMTTTGGTCrCCrrGACTGMT^ 

CTAGMTGAMGAMTGTCMGMGrCTGMGATGATTCTTGMTTTTGG 1 1 1 1 I IGCTA 

TAGMGATMGAAMCGMGATAGCTTCTACCAA^ 

TGATATGTAGCGAATTTATGTCCTTATO 

TATGTATGATMTTATAGGGCCAT^ 

ATTTTGATATATCTATTTACrGTATAMTTCATATGGrATTCCAMCCCT^ 

TTTTTTTTTGCITTTAAAAATGTTTATGGGTATATAATAGT^ 

[C,T] 

ATATATTTTGATATMGC^TAG^TCTGTMTGACOWVT 

TCACCTCAAGCATTTATCAI 1 1 C_ I I I I IGI I AGAGAC^TTCTAATTTGACTCTTCTAGTT 
ATTTTGAMTATA^TGMTTATTGTrTMCTATACT 
TTAGTCCTTCTMCGGTATTTTGGTAC^^ 
CCCTACTACCTITCCG^GCCTCTGCTM 

Al I I I I I I I IGLI I I I AAAAATGTTTATGGGTATATAATAGTTGTACATATTTATGAGAC 
ACttTATATTlTGATATMGCATAC^^ 

CATCACCTCAAGCATTTATCAI I I LI I I I IGI I AGAGACATTCTAATTTGACTCTTCTAG 

TTATTTTGAAATATACMTGMTTATTGTTMCTATAGTCA^ 

CTTTAGTCCTTCTMCGGTATTTTGGTACCCA 

[T,A] 

CCCCTACTACCTTTCCCAGCCT 

AG I I I I I I I I I AAACTCCCCTATATGAGTGAGAACATGCAGTATTTGTL I I I I IGTGCCT 
GGCTTATTTCACTTMTGTMTGTTCTCTMTT^ 
TTTCATTLI I CI I ATGGCTGTCTATATGTACCACATTTW 
TGGACACTTAGGCTGATTTCATATCTT^ 

MTGTTTATGGGTATATMTAGTTGTACATATTTATGAGAC^ 

CATACAATGTGTAATGACCAAATCAGGGTMTTGGGATATCCATCACCT 

CATTTGTTTTGTTAGAGACATTCTMTTTGACT 

GMTTATTGTTMCTATAGTCATCCTATTGTGCATGCCAGACriTACT 

ATTTTGGTACCC^TTAACCAATGCCTCTTTATCCTTCCCCCA 

[C,G] 

CCTCTGGTAACG\TCATTCrrCTCACTATCTCTATAAGGTCAG I I I I I I I I IAAACTCCC 
CTATATGAGTGAGAACATGCAGTATTTGTC I I I I I GTGCCTGGOTATTTCACTTAATGT 
MTGTTCTCTMTTTCATCCACAT^ I I LI IATGGCT 

GTCTATATGTACCACATTTTATTTATCCACTCATCTGTTGATG^ 
CATATCTTGGTCATTGTGAATAGTGCTGTACT AMCATGGGGGTGC7\GATGTCTCTTCCA 

AGAGATAGAGATCTMTTTCATTCTTC 

TCTTGTGGAMTTGTCCTTTGCCCAATGTATGI ILI I GATGCL I I IGI IGAAAATTAGTT 
GACTATAAATGTGTGGATTTATTTGTGGG I ILI I I ATTCTGTTCCATTGGTCTATGTGTC 
TGI I I I I ATGCCAGTATCATGCAG I I I I GATTATTACAGGTTTGTAGTATAATTTGAAGT 
CAGGTCATGTGATGCCTCCAGL I I IGI ILI I ! 1 1 I CTCAGAATCTTATATTTAGAAAAAC 
[C,G] 
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TAMGACTCCMCAAAAMCCTGCTA 
ACAACATCAACATACAAMTTCAGCAGGAT^ 
AAAAAGAAAGAAAAAAAAACAAGAMTMTCCCATTW 
ACACCTAGGMTAAACCATACCAAAGAAGT^^ 

ACTGATGAMGAMTTGAAMTGACATTAAAAMTGGAMGGTATTCCATGT^ 

Al I ICI I GTGGAAATTGrCCTTTGCCCAATCTATG I ICI IGATGCCI I IGI IGAAAATTA 
GTTGACrATAAATGTGTGGATTTATTTGTGG(j I I C_ I I I ATTCTGTTCCATTGGTCrATGT 
GTCTGI 1 1 I I ATGCCAGTATCATGCAG I I I I GATTATTACAGGTTTGTAGT ATAATTTGA 
AGTCAGGTCATGTGATGCCTCCAGC I I IGI ICI 1 1 I I I CTCAGAATCTT ATATTTAGAAA 
MCGTAMGACTCCAAOWW^CCrGCTAGMCTGATAMCAAAT^ 
[G,A] 

GATACMCATCMCATACAAMTTCAGCAGCA 

AAAAAAAAGAMGAAAAAAAMCMGAMTMTCCCATTTATMTAGCTAGW 
AAMCACCTAGGMTAMCCATACCAAAGAAGTGAMGATTTCrACM 
MCACTGATGAAAGAMTTGAAMTGACATTAAAAMTGGAMGGTAT^ 
GATTGCMGAATCAATATTGTTAAMTGTCCATATGATCCAAMCA^ 

ATTGTCCTTTGCCCAATGTATG I ICI IGATGCCI I IGI I GAAAATTAGTTGACTATAAAT 
GTGTGGATTTATTTGTGG G I IC I I I ATTCTGTTCCATTGGTCTATGTGTCTG I I I I IA TG 
CCAGTATCATGCAGI I I I GATTATTACAGGTTTGTAGTATMTTTGMGTCAGGTCATGT 
GATGCCTCCAGC I I IGI ICI I I I I I CTCAGMTCTTATATTTAGAAAAACGTAAAGACTC 
CAACAAAAMCCTGCTAGMCTGATAMCAMTTC^TTAM 
[A,G] 

CATACAAMTTCAGC^GCATTTCMTATGCCMGAGCAM 

AAAAAAAMCAAGAMTMTCCCATTTATMTAGCTACAMTAAM 

ATAMCCATACOW^GAAGTGAMGATTTCTACMTGAAM 

GAMTTGAAMTGACATTAAAAMTGGAMGGTATTCCATG1TCAT 

MTATTGTTAAMTGTCCATATGATC^ 

TGTGGATTTATTTGTGGG I ICI I IATTCTGTTCCATTGGTCTATGTGTCTGI I I I IATGC 

CAGTATCATGCAG I I I I GATTATTACAGGTTTGTAGTATMTTTGMGTCAGGTGATGTG 

ATGCCTCCAGCI I IGI ICI I I I I ICTCAGAATCTTATATTTAGAAAAACGTAAAGACTCC 

MCAAAAMCCTGCTAGMCTGATAMCAAA7TC 

(ZATACAAMTT(ZAGX!AGCATTTCMTATGCCMGAG 

[-,A] 

AAAAAAMCAAGAMTAATCCCATTTATMTAGCTACAMTAAAA 

TAMCCATACCAAAGAAGTGAMGATTTCTACAATGAAAACTATA 

AMTTGAAMTGACATTAAAAMTGGAMGGTATTCCATGTTOT 

ATATTGTTAAMTGTCCATATGATCCAAMCMTCTACAG^ 

AMTACCA^TGACATTCTTG\TTGAMTAAAAAAAMGCCTAAM 

MTAATCTTAAAAAAMCAAAGAAAAAAAMGAAGAA^ 

AMTAAMTAAMCACCTAGGMTAMGIATACCAAA 

AAACTATAAAACACTGATGAAAGAMTTGAAMTGACATTAAAAAATGGAM 
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ATGrr(^TGGATTGCMGMT(^TATTGnTAAMTGTCCATATGATCOWVkO^TCTA 

CAGATTCMTGCAATCCCTATCAAMTA^ 

[-,A,G] 

CCTAAMTTTMGTGGMCCATGMGGTAGATGTCrGCrATACA^ 
CMOWKOTGMTATGMGACTGGG^ 

TGGTGAMTTTAGGAGAATGGATGTTTTATAATGGGrAGCAGI NCI I ACATGTTCTCAA 

TCAGCCATMCTTACTACAGTCMTTTGAATTTATTGCAT^ 

ATAAMTCCTAAAAMGGAGAGAAGCACATATAMCCTGCGTCTTATTTCATCT 

TAGATGTCTGCTATACATAGMGATTAAGrACrCAA 
GMGTGMTAGGCAGCTTCACTCTTC^^ 

TATAATGGGTAGCAG I I PCI I ACATGTTCTCAATCAGCCATAACTTACTACAGrCMTTT 
GMTTTATTGCATTTGAATATATTGGATTAAAM^ 

CATATAAACCTGCGTCTT ATTTCATGTGTTCL 1 1 I LI I I GTGGGTGAC I I I IGI I I IGAA 
[A,G] 

TAAMCCTGCAAAATAACAGGACAGGGTGGAAGGGAGATGGGATCCCCTC^^ 
AGCAGCAGTCCTG I I I I ATCACCTCTTG\TTTTCTGTTATTGAGAATTCAAGAAGAAG^ 
GGAGGAAGAGTTCACATCCACAGACTGCTCTGGFrGAATAGTT 
MTAGCAGCCMTGAGGCTGTTACAGTGAAGCC^GTCCCMGATMTTG^ 
TATTCrcrMGMGOAMTTGTGTTAGACTGAAACC^TM 

TGCAAMTMCAGGACAGGGTGGAAGGGAGATGGGATCCCCT(XIT^^ 
CTCCTGTTTTATCACCTCrrCAT^ 
GAGirCACATCCACAGACTGGTGTGGlTGAATAGrrGrCTCT 
GCCMTGAGGCTCTTACACTC^ 

TMGMGCTAMTTGTGTTAGACTGAMCCCATMGGAACCATTG 
[T,C] 

TCAAAAGTAAAGA I I I I I MTAGTTTCTOTAATTAGATTATTTTCTAAGACATAGAATT 
ATGATTACTATTTTATCrCTATMTTTTC^^ 

CCTTTGGAAAAAATTGGC I I I I AGCTTTAC 1 1 I I GCAATATTTTATTTTATCCCGXTAAA 
AGCCTAGGAAATTGGTACTATGAC I I I I AGTATGTTCATTTAATAGATGAAAACACAGAA 
ACTCAMGATGTTAMTATGGTGGCCMGTTCACAMGCrG^ 

GGTGTGGTTGAATAGTTGTCTCT ACTGTATTCCAAATAGCAGCCMTGAGGCTGTTACAG 
TGMGCCAGTCCCAAGATMTTGTTCTGTACCCCTATTCT 

AGACrGAMCCCATMGGMCCATTGTTCAAAGTTGGLI IGI I CAAAAGTAAAGA I I I I I 

MTAGTTTCTCTTMTTAGATTATTTTCT 

CTATMTTTTCATCTCTATMCGTTTACAMTACT 

[T,C] 

TTTAGCTTTAL I I I I GCAATATTTTATTTTATCCCCATAAMGCCTAGGAAATTGGTACT 
ATGACTTTTAGTATGTTCATTTMTAGATGAAMCAC^ AAATAT 
GGTGGCCMGTTCACAMGCrGATCATTMCAACM 
GATTTMTCTGTGACAGTGCACCTGGGTGCGCATGCATGCA^ 
TAGAACCTTTCCTAGTTGGCTrrGCTCCATGATGACCA^ 
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CTCAMC^TGTTAMTATGCTG^ 

CTGAACTCCTGGTITTCTGATTTMTCTGTGACAGTG^CCrGGGT 

CACCCCC^CACTTGCACATAGMCCTTTCCrAGlTGGClTTGCT 

TGTTCCITCTACTTCAAMTMGCAMTTATCCrACAGATT^ 

CTGrCMGCAGCCCATTCCATTAGrCAGGTGTGGTTCACT 

[A,T] 

AATGGTATATTTATCTAGATAATTCTACL I I C I I ATTTTCAAAGCCCCAGTL I I C I I I G C 
TAATTCTGTGCATCA 1 1 I I I CTCTGATTCTGAAAGGCAAAA I ITTG1 IGGGCAATTGCTG 
TAATATGAG 1 1 I I ATCrCCTTTAGAGTCGMTGGATGTGTATATGTCACATGCTCCC^CT 
GGTTCATCAGTACACMCATTCrGCATATAAMCAGOT 
ATTCCMTCCTTATTTTCMTATATTTAAA 

ACMCAGGGCCTGMCTCCTGGTITTCTGATTTMTCT 

ATGCATGCATCACCCCCACACTTGCACATAGMCCTTTCCrAt^^ 

TGACCATTACTGTTCCTTCTACTTCAAMTMGCAMTTATCCT^ 

TACAGGTGTGCTGTG\AGCAGCCCATTCG\TTAGTCAGCTTGTGGTTCACT 

GTATTGACCTAAATGGTATATTTATCT AGATAATTCTACL I Id I ATTTTCAAAGCCCCA 

[G,A] 

TCI IGI I IGCTAATTCTGTGCATCAI I I I I CTCTGATTCTGAAAGGCAAAA I I I PG1 IGG 

GCAATTGCTGTMTATGAGTTTTATCTCCTTTAGAGTCGMTGC^ 

TGCTCCCACTGGTTCATCAGTACACMCATTCTGC^TATAAM 

ATC«AAAACCATTCCAATCCTTATTTTCAATATATT^ 

MCAGGCCTACCCTMGMTCTTMC^GCrrGCTT 

TMGAC<TTCCTTCCAGTTTCTCCTTGCTCtCCTTCT 
TMCAGAMC<^TGTTATC<rrA(IAGACCAAGTACA^ 

TCAGAGI I I IAAAMTAGGCCCTGAACTGMG(^GAGCTAAACTAGGGAAGCCT(^G(^ 
GMCTGAGACTT(TCCAGAGAGMGTATCTGGGATTTAAL I I CI I I CTAATGAGGCTTGG 
TTTTCCATGAACTTTTCCTTTAAACCAAGGGGGGTAT^ 
[A,G] 

TTTCTCATMTTCTAAMTGGGTGGTTACATCCTTCTGCTGAT 
GTCCTAGCATACAGCA I I I I I CTAAMTTTGCTGTTAGCTTTCATGATTCTTACCCTAAC 
TATTC I I I I I CTAAAAAACA I I I G 1 1 1 CAGCTITACCACTCTGATGAATTCAGAGCTTAT 
C^CTGGGGAMTGACCCTWMTATGAMC^^ 
CCCCAGCATCCTGATTTTC^TAMTTATAATAAAAMTTATTTGA 

AGTAGATCV\CATAAATGAACACCACCTTAAATCAG7\GI I I i AAAAATAGGCCCTGAACTG 

MC^AAGAGGTAMCTAGGGMGCCTCAC^GMCTC^GACTTCT 

TGGGATTTAAC I I CI. I I CTAATGAGGCTTGG I I I I CCATGAAL I I I I CCTTTAAACCAAG 

GGGGGTATTGCTCATCTTTCTGTTGAGTCCCATTTGT CATMTTGTAAAATGGGTGGTTA 

CATCCXTCTGGTGATCTAGGAC*:^^ I 1 1 ICTAAAAT 

[T,G] 

TGXTGTTAGCmC^TGATTCTTACCCTAACTATTC I I I I I CTAAAAAACA I I IGI I ICA 

GCTTTACCACTCTGATGMTTC^G^ 

ACATTACMTCACCTCV\C<TATTTACACT^ 
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ATAAAAMTTATTTGAGGGT(^^ 

TAGAALI I I I I I 1 1 AAAAAMTTTTMTTTTMTTTTMTTTATTTCAG 
GGGGTATTGCTCATCTTTCTGTTGAGCCCCATTTCT 

ATCCITCT(XrrGATCTAGGAGCCCTATTTTCGTCCTAGCATA(^G<^ I I I I ICTAAAATT 

TGCTGTT AGCTTTCATGATTCrTACCCTMCTATTL I I I I ICTAAAAAACAI I IGI I ICA 

GCTITACG^CTCrGATGAATTCAGAGCTTATGACrGGGGAMTGACGCr^ 

ACATTACAATCAGGTGAGCTATTTACAGTMCCCCAGCATGCTGAT^ 

[A,G] 

TAAAAMTTATTTGAGGGTGGAAAGAa"CCTACCTGTCATTTGCT 
AGAALI I I I I I I I AAAAAMTTTTMTTTTMTTTTAATTTAT^ 
TTAAAGMGCATATACAMGAMCTTACATCATGTGTMTCOT 
AGATGTACrMCATTTTGGTGTATTTATTCCMTTTrCTCAGTA i I I IAGA 

CAACTTTTMTaTTCTATTTTA(XrM 

ATCTAGGAGCCCTATTTTCGTCCTAGCATACAGCA I I I I I CTAAAATTTGCTGTTAGCTT 
TCATGATTCTTACCCTAACTATTL 1 1 I I ICTAAAAAACAI I IGI I I CAGOTTACCACTC 
TGATGMTTCAGAGCTTATGACTGGGGA^ 
GGTGAGCTATTTACAGTMCCC^GCATGCTGATTTTGATAAATTA^ 
TTGAGGXTTGGAMGACrCCTACCTCT I I I I I I 

[T,C] 

TAAAAAMTTTTAATTTTAATTTTMTTTAT^ 

ATACAAAGAMCTTAC^TCATGTGTMTCCTTCGVTCCAGAGATA^ 

ATTTTGGTGTATTTATTCCMTTTTCT 

TTTCTATTTTACrTAAGCTATAGTMGAGATMCTAATATAACTGAGG^ 1 1 I I IAAATG 
CATTTTTMTGGCTACATAATAGAAA^ 

AAMTGAM(IAAMTCMCACGCACATT^ 

GAGAGTGTTMTGTCCTTAGMTTTGGCCACAGTTAGCTGGTCCTACT 
GGTCCTATTTTGTGAATTMTCTCATTTGATGCCAA I I I I I ATTACATTCTCTCCAAAAA 
ACTAGTCTCMCAGTITTGCrCTCTCCTCMGTTCACA 
TTTTATTGAGTATMGTVCWm^ 
[A,G] 

I I I IGI ICACCAGTGI 1 1 I CTCATCn"GMGAGTACATGACAATTACTGGGCTCCCAGTA 
TCTATGTGTTGCATTAATGAAA I 1 1 LI IMGTTMTCTACCTCAAAATGTCTCTATCTT 
CTTGATTCTCTCCTTCCTTTCTCTATCAGAAM 
TCCGGTCCrGTGCCCTTGATCCCATCTCrr(TG\CrrCCCC^ 
TCCTGTCCCTTATGAAAM(^\AGC^ 

TGAAGATCATTATGCTOV^GTACTAAACT 

CCACAGTT AGCTGGTCCT ACTCTGCTCCMGCCGGTCC^^ 

TGATGCCAA I I I 1 1 ATTACATTCTCTCCAAAAAACrAGTCTCAACAGTT^ 

CMGTTCACAGO\TTATCTCTGCrATATCrATATTTTA^ 

ATGTMGXTCCATGAGGGTAGGGATTTCTCATCG I I I IGI I CACCAGTGTTTTCTCATCT 
[T,G] 
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C^GACTACATGACMTTACTGGGCTCCCACTA 

TMCTTTMTCTACCrCAAAATGTCTCTATL I ICI I GATTCTCTCCTTCCTTTCTCTATC 

AGAAMTGATGCTCCTCTTATTTTCCAACTTATC 

CTTCTCACTTCCCCTTCCTTCCTGCCTC^^^ 

ACCATCAATTCTATCAAGTTATCATTATCTCACrCTGI ICI I ATCAACATA I I I I I ACTA 

18984 CAGTATGTATGTGTTGCATTAATGAAA I I I LI I MaTTMTCTACCTCAAAATCTCTCT 
ATCTTCTTGATTCTCTCCTTCCTTTCTCTA 
GTTATTCCGCTCCTCTGCCCXTGATCCCAT^ 
CATTCTCCTCTCCCTTATGAAAMCAAC^^ 

CTCACTCTG I ICI I ATCAACATA 1 1 1 1 IACTATTGAAGAGGGU ICI I CTACTTACTCCT 
[G,T] 

MCCTTCTACAATCTAGTTTAGCTCTTCATL I I I I I ATCATAGCTACCTTATTTAAAGTC 
ACCCATGGCTTTTAATTGCCAAATTCA^TGGCCTATCTTCACC I I I I GAAATCTCTTATG 
TTCGTTACCAOXCTCTCCTTGAMCT^ 
H> TTTCTGATTTTCCTTCTGTTTCTGATTGTTCCTTT^ 

Q TTCCACCTCTCTGAMTCATTAGCATT^^ I I ICI ICCT 

O 

Q. 19407 CGTTACCACACTCTCCTTGAAACTCACTCCCCTGACTTGGACr^ 
£ TCT GATTTTCCTTCTGTTTCTGATTCTTCCTTTTCTCCCA 

% CCACCTCTCrGAAATCATTAGCATTCCCCMC^TTCTTCAAAA I ICI ICCTTG 

pi GAGAACTCAGCATAGCTTTMTTTGGACCATTTCTATGGCT^ I I I I I ICAGGA 

" W CTTGCCTTCAACCTATTCTTTCTCTAGCTG^ 



U [C,T] 

fU GMGACAGACCTCCGAGAAATGACCCTTCTCTCCA^ 

O CCTAGCCTGACATTCAGACTTTGATTATCTGCCTC 

m TTATATATTCTGTTCTCCAGCTACACTGGGAAGCrTGCC7\TTCCT 

Q ACTCTTCCTGCCrCCCACTCACCCTCATCTCTGCT 

CTCATTT^^GGACCCCTOTTCTATGAAGCCCTC^GCTGGAAATM I I I I I IGCCTTT 

19531 CTCTCTGAAATCATTAGCATTCCCCMGGATTCITCAAAACTCTC I I I C I I CCTTGGAGA 



ACTCAGCATAGCTTTMTTTC^CCATTTCTATGGCTrATCTAGA I I I I I ICAGGACTTG 

CCTTCMCCTATTCTTTCTCTAGCTGATTCCATTM 

GACAGACCTCCGAGAMTGACCCTTCTCTCCAAMCTTCCGCMTAT^ 

AGCCTGACATTCAGACTTTGATTATCTGCCTCC^ 

[T,C] 

ATATTCTGTTCTCGAGCTACACTGGGMGCTT^ 
TrCCTGCCTCCCACTCACCCTCATC 

TTTCAG^GGACCCCTCTTTCTATCW\GCCCTCAGCTGGAAATAA I I I I I IGCCI I I I I 1 1 
CCATTTTATTTTTGGACTGTTTATGGCAT^ 

GCCTTCCTCCCTCTTTTGCAAA I I ICI I AAAGCTAGAGACCATTCTATG I I I I C I I CATA 

19911 CTCATCTCTGCTCTCAAAATGCAACCTTCCCT 

CTATGAAGCCCTCAGGTGGAAATAA I 1 1 1 I IGCCI 1 1 I I I ICCATTTTAI I 1 1 IGGACTG 
TTTATGGCATTTMCATACCTTACTTTCTA^ 
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AAAI I ICI I AAAGGTAGAGACCATTGTATG I I I ICI I CATATGTTGCTGGTGCCTAACAG 

MCTATGGCCATTXnCCaCATTCAm 

[C,T] 

CTCTCATGMTGCCCTTGCTTTCT^ 
GCCATGAMCTGCCTACTGCTAT1TGGGCTG 

GATGTGGCCAGGATACTCCCTCAMTCMGAGTCrrCATTACTTTMGCT 
TGGMCC^CTTTGATTTTGTCTGGGGCCTCGATGCCCCrCAACGGATGr ACAGTGAAATC 
ATAGGTTMTGMGGCATATTCCTAMTGCMTGCATTTACIT^ 

TTTGAGGAGCTTCCTCTCATGMTGCCGTGCriTCTCTCCCACAGACT 

ATATGACCTGACTGCCATGAMGTGCCTACrGCrATTTGGGCTGGTGGACATGATGTCCT 

CGTMCACCCCAGGATGTGGCCAGGATACTC^ 

GCTATTGCCAGATTGGAACCACTTTGATTTTGrCTGGG^ 

GTACAGTGAMTCATAGCTTTMTGMGGCATA^ 

[A,G] 

ATTAAMGTTGCITCCMGCCCATMGGGACrrTAGAAAAM 

TTGTCCCCCAGCACCCTGGGGGAGATGCACAGTGGAGTCTG I I 1 1 CCAAGTCAATTGTGT 
TAGTGTTATTTATGTTTAGAGACAT^ 

ATGAGGTAGATTAGGCAAAMGATAMCAAGTTGCTACTCTATCrGGC^ 
TTAAATTGTAA I I I I I AGGGCATACG\TGAAGTATAGAAATGTCrGAAGCTTCAAAGGAA 

AGAGTCATCCCCCTATATATGACCTGACrGCCATGAAAGTGCCTACrGCrATTTGGGCT 

GTG(^CATGATGTCCTCGTM(^CCC(^GGATGTGGC(^GGATACTCCCrCAAATCAAGA 

CTCTTCATTACTTTMGCTATT^ 

ATGCCCCTGVACGGATGTACAGTGAMTCATAGCITTMTGAAGGCATATTCCT^ 

MTGCATTTACTTTTCMTTAAMGTT^ 

[G,A] 

GTMCCAACMTGAGGTTGTCCCCCAGCACCCTGGGGGAGATGCACAGTGGAGTCT 
TCCAAGTCAATTGTGTT AGTGTITATTTATGTTTAGAGACATCTTTGCArGGGACCATCTA 
(^GGTCCTTATAMC^TGAGGTAGATTAGGOWWVGATAM 
TGGCATTTMGTCTAATTAAATTGTAA I 1 1 I I AGGGCATACCATGAAGTATAGAAATGTC 
TGMGaTOW\GGMCAGTGAMTTCOTTMGGTCCTAW 

GA(^TOTTGCATGGGACCATCrACAGGTCCTTATAAACMTGAGGTAGA 
AGATAMCMGTTGCrACTCTATCTGGO^TTTMGTCTMTTAMTTGTM I I I I IAGGG 
CATACCATGMGTATAGAMTGTCTGMGCTTCAMGGMCAGTGAM 
CCTATATGGAMCCTCTGTTGTCATTTTATTTATATGG^ 

TGTGGGATTAGGAGGAGGGCCTGTAAL I PCI I IATAAAACI I I CI I AGCTATCCTGAAGA 
[T,C] 

GTATAGACA I 1 1 I I AC I I I I I I AGGTATTTTCMCATCAGAMTTCAAAAAAGTCCCCAA 
AGATTCTTCCAGAGAAGCCCTCI I I ICI I ACAATCTTATCCCTGGCTATCTGCGTAAACG 
GMTCTTGAACCCATMTAGGATACATGTATAAMTCTTCaTA^ 
TTGTACAGCATCMTATCATTTTATMTCATAGGGAGGC I ICI I ICI I I AGCATGTAATG 
CCCCCTTTACAGGC I I I I I G I I C I I I GAGGGGTTTGMCATTCCATGAAAAACTGACAGA 
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21156 AGGLI ILI I IGI 1 1 AGCATGTMTGCCCCCTTTACAGGG I ITTTC1 ILI I IGAGGGGTTT 
GMCATTCCATGAAAMCTGACAGATAGGAMCTGACMTAAMGATTGAGCT 
GMGCAGAAAGTACTAGGCTAGATACTCTCTAAACATTAAGTA I I I ILI ICCTCCATCTT 
AAMGCMTGAGMGCCACCAAMTATTTTACCTMTGGAMCCTGATTGCCGCA 1 1 1 I I 
GTAACCACCACTTTGGCTGCT ACATAGAGMTGGATTAGMGATGCCAACAAAAGATTCr 
[G,C] 

AGCMGTCrGTAMTCTGATCMGTGTT^ 
GAGATGATCCTTGGAAMTCG^GAGCCAGCT 
TCCACAAGCTGCTGGCCCCTGGAGCCATTCTTCTCTCAAMCT 
TGTATACCTATTGATGGGGMTMTGGrCACrATGAAMCCATGTGATMTATGGAAAAA 
TACCG\TGATATMTGriTATGrGMGAGAAGAAMTGAMCTG(7rAGAACTATGTGATTG 

I I IGI I I AGCATGTAATGCCCCCTTTACAGGC I I I I IGI ICI I I GAGGGGTTTGAACATT 
CCATGAAAMCTGACAGATAGGAMCTGACAATAAAAGATTGAGCTAMGATQ 
AAAGTACTAGGCTAGATAGTCrCTAAAG\TTAACTA I 1 1 ICI I CCTCCATCTTAAAAGCA 
ATGAGAAGCCACCAAMTATTTTACCTMTGGAAACCTGATTGCCGCA I I 1 1 1 GTAACCA 
CCACTTTGGCTGCTACATAGAGAATGGATTAGAAGATGCCAACAAM 
[A,T] 

CTGTAAATCTGATCAAGTGTT CT GATGCAGGCTGATATCCTTCTGT GCTAAGAGAGATGA 
TCCTTGGAAMTCCAGAGCCAGCTCCATMTAC1TTCCTG 
GCTGCTGGCCCCTGGAGCCATTCTTCrCTCAAAACTAGCAT^ 
GTATTGATGGGGAATAATGGTCACTATGAAMCCATGTGA 
GATATAATGTTATGTGAAGAGMGAAAATGAA^ 

MTGGATTAGAAGATGCCAACAAMGATTCTG^ 
CTGATGCAGCCTTSATATCCTTCTCT 
GCTCCATAATACTTTCCTGCTCTGCTC^^ 
TCTTCTCTCVWVACTAGCATTCATCMTTTMTCT 

CACTATGAAMCCATGTGATAATATGGAAAAATACCO\TGATATAATGlTATGrGAAGAG 
[G,A] 

AGAAMTGAMCTGGTAGMCTATGTGATTGCAMTATATAO\M 
ATGACTITATAAAATATTTGTATATMTGAAMCrGMGCM 

TTGTGTCAGGGTAGTAACATGATGAGTGATTAATAG I I I I IAAI I I I I AATATAGTAATG 
ACATMTGTTACMCTTGTCCAMTCTC^CAMCATMTATTCA 
ATAAAAGAATACATATTTTATTATACAI I I I I ATGTAGGCTAATTGATGGTTCTGAAAGC 
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